Cyclin-dependent protein kinase

ABSTRACT

An isolated nucleic acid molecule is disclosed which encodes a novel human cyclin-dependent kinase (CDK) which comprises a novel cyclin binding domain signature sequence and lacks several heretofore conserved amino acid residues involved in regulation of the cdk/cyclin complex. Associated proteins and biologically active mutant forms are also disclosed.

CROSS-REFERENCE TO RELATED APPLICATIONS

Provisional Application U.S. Ser. No. 60/037,855 filed Feb. 7, 1997.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH

Not applicable.

REFERENCE TO MICROFICHE APPENDIX

Not applicable.

FIELD OF THE INVENTION

The present invention relates to an isolated nucleic acid molecule(polynucleotide) which encodes a novel human cyclin-dependent kinase(CDK) comprising a novel cyclin binding domain signature sequence andlacking several heretofore conserved amino acid residues involved inregulation of the cdk/cyclin complex. The present invention also relatesto associated human CDK proteins and human CDK mutant proteins.

BACKGROUND OF THE INVENTION

Cell growth and division in eukaryotic organisms is mediated through thecell cycle. The cell cycle consists of two major events separated by twocentral gap phases. DNA synthesis and replication occur during the Sphase while mitosis occurs during the M phase. A first gap phase, calledG₁, which occurs between the M phase and the S phase, allows foraccumulation of enzymes and other compounds necessary to drive DNAsynthesis and genome replication. A second gap phase, called G₂, occursbetween the S phase and the M phase, allowing for controls to check forproper DNA replication prior to committing to cell division. Transitionto and passage through the four stages of the eukaryotic cell cycle areregulated by a family of cyclin-dependent protein kinases (CDKs).Activation of a CDK requires binding to a cyclin regulatory subunit, andin the case of CDK1-CDK6, phosophorylation of threonine 160/161(Thr160/161). These CDKs contain a cyclin binding site near the aminoterminal portion of the protein. The activated CDK/cyclin complexphosphorylates proteins involved in various stages of the cell cycle.

The family of cyclin proteins may generally be classified as either G₁cyclins or mitotic cyclins, depending on peak expression levels. A CDKmay bind a subset of cyclins. For example, CDK4 is known to bind cyclinD1 or cyclin D3 whereas CDK2 is known to bind cyclin A, cyclin B1,cyclin B2, cyclin B3 and cyclin E. The vertebrate cyclins show homologywithin a region of approximately 100 amino acids, referred to as thecyclin box. This region is responsible for CDK binding and activity(Kobayashi, et al., 1992, Molec. Biol. Cell. 3: 1279-1294; Lees, et al.,1993, Molec. Cell. Biol., 1993, 13: 1194-1201). It is this region of thecyclin protein which interacts with the cyclin binding domain of arespective CDK protein.

Complete activation of a known CDK/cyclin complex requiresphosphorlyation by a CDK-Activating Kinase (CAK). The vertebrate CAK hasbeen identified as a CDK/cyclin complex, more specifically CDK7/cyclinH(Fisher and Morgan, 1994, Cell 78: 713-724). The CAK enzyme comprises athreonine 170 residue (in human CDK7) which has been shown to berequired for optimal activity (Poon, et al., 1994, J. Cell Sci. 107:2789-2799; Fisher and Morgan, 1994, Cell 78: 713-724).

Inhibition of CDK/cyclin complexes are thought to occur viaphosphorylation at threonine 14 (Thr14) and/or tyrosine 15 (Tyr15) ofthe CDK subunit. The Weel kinase has been suggested as either a Thr14kinase or as a Thr14 and Tyr15 kinase. Additionally, CDC25 is thought tobe a dual kinase targeting both Thr14 and/or Tyr15 (Morgan, 1995, Nature374: 131-134).

It would be advantageous to identify a gene encoding an additional CDKprotein. A nucleic acid molecule expressing a CDK protein would beextremely useful in screening for compounds acting as a modulator of thecell cycle. Such a compound or compounds will be useful in controllingcell growth associated with cancer or immune cell proliferation.Additionally, the recombinant form of protein expressed from such anovel gene would be useful for an in vitro assay to determinespecificity toward substrate proteins, inhibitors and cyclin activators.Additionally, an isolated and purified CDK10 cDNA which encodes CDK-10or an active mutant thereof will also be useful for the recombinantproduction of large quantities of respective protein. The ability toproduce large quantities of the protein would be useful for theproduction of a therapeutic agent comprising the CDK10 protein or amutant such as the exemplified mutant disclosed herein. A therapeuticagent comprised of CDK10 protein would be useful in the treatment ofcell cycle and/or CDK10 related diseases or conditions which are CDK10responsive. The present invention addresses and meets this need.

SUMMARY OF THE INVENTION

The present invention relates to an isolated nucleic acid molecule(polynucleotide) which encodes a novel human cyclin-dependent kinase.This CDK comprises a novel cyclin binding domain signature sequence(Pro-Asn-Gln-Ala-Leu-Arg-Glu; SEQ ID NO:1), lacks Thr14 and/or Tyr15,and also lacks the T-loop domain containing the conserved Thr160/161residue.

The present invention relates to biologically active fragments ormutants of a novel isolated nucleic acid molecule which encodes mRNAexpressing a novel human cyclin-dependent kinase. Any such biologicallyactive fragment and/or mutant will encode a protein or protein fragmentcomprising a novel cyclin binding domain signature sequence(Pro-Asn-Gln-Ala-Leu-Arg-Glu; SEQ ID NO:1), which lacks Thr14 and/orTyr15 as well as a T-loop domain containing the conserved Thr160/161residue. Any such polynucleotide includes but is not necessarily limitedto nucleotide substitutions, deletions, additions, amino-terminaltruncations and carboxy-terminal truncations such that these mutationsencode mRNA which express a protein or protein fragment of diagnostic,therapeutic or prophylactic use.

The isolated nucleic acid molecule of the present invention may includea deoxyribonucleic acid molecule (DNA), such as genomic DNA andcomplementary DNA (cDNA), which may be single (coding or noncodingstrand) or double stranded, as well as synthetic DNA, such as asynthesized, single stranded polynucleotide. The isolated nucleic acidmolecule of the present invention may also include a ribonucleic acidmolecule (RNA).

A preferred aspect of the present invention is disclosed in SEQ ID NO:11and FIG. 1, a human DNA fragment which encodes the novel humancyclin-dependent kinase, CDK10.

The present invention also relates to a substantially purified novelcyclin-dependent kinase which comprises a novel cyclin binding domainsignature sequence (Pro-Asn-Gln-Ala-Leu-Arg-Glu; SEQ ID NO:1), lacksThr14 and Tyr15 which make up the conserved ATP binding motif of severalknown CKDs, and also lacks the T-loop domain containing the conservedThr160/161 residue.

The present invention also relates to biologically active fragmentsand/or mutants of a novel cyclin-dependent kinase which comprises anovel cyclin binding domain signature sequence, lacks Thr14 and/or Tyr15which make up the conserved ATP binding motif of known CKDs, and alsolacks the T-loop domain containing the conserved Thr160/161 residue,including but not necessarily limited to amino acid substitutions,deletions, additions, amino terminal truncations and carboxy-terminaltruncations such that these mutations provide for proteins or proteinfragments of diagnostic, therapeutic or prophylactic use.

A preferred aspect of the present invention is disclosed in SEQ ID NO:3and FIG. 2, the amino acid sequence of CDK10. The open reading frame ofthe CDK10 coding region runs from nucleotide 210 to nucleotide 1182 ofSEQ ID NO:2.

Another preferred aspect of the present invention is disclosed in SEQ IDNO:11, wherein nucleotide 588 of the wild-type form (SEQ ID NO: 2) ismutated from "G" to "A".

Another preferred aspect of the present invention is the mutant protein,(CDK10-D127N), wherein nucleotide 588 of SEQ ID NO:11 is mutated from"G" to "A", as compared to the wild-type form SEQ ID NO:2), whichresults in a change of Asp127 to Asn127 as compared to the wild-typeamino acid sequence (SEQ ID NO:3), disclosed as SEQ ID NO:12.

The present invention also relates to methods of expressing hecyclin-dependent kinases disclosed herein, assays employing thesecyclin-dependent kinases, cells expressing these cyclin-dependentkinases, and compounds identified through the use of thesecyclin-dependent kinases, including modulators of the cyclin-dependentskinase either through direct contact with the cyclin-dependent kinase,an associated cyclin, or the CKD/cyclin complex. Such modulatorsidentified in this process are useful as therapeutic agents forcontrolling cell growth or immune cell proliferation commonly associatedwith cancer.

DETAILED DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the nucleotide sequence (SEQ ID NO:2) which comprises thefull length cDNA encoding human CDK10.

FIG. 2 shows the amino acid sequence (SEQ ID NO:3) of human CDK10.

FIG. 3 shows the strategy utilized to generate a full-length DNAfragment encoding human CDK10.

FIG. 4 shows northern blot analysis of human tissue mRNA hybridized to a³² P-labeled probe from the 3' region of the DNA fragment encoding humanCDK10.

FIG. 5 shows northern blot analysis of human tissue mRNA hybridized to a³² P-labeled probe from the 3' region of the DNA fragment encoding humanCDK10.

DETAILED DESCRIPTION OF THE INVENTION

The present invention relates to an isolated nucleic acid molecule(polynucleotide) which encodes a novel cyclin-dependent kinase whichcomprises a novel human cyclin binding domain(Pro-Asn-Gln-Ala-Leu-Arg-Glu; SEQ ID NO:1), lacks Thr14 and/or Tyr15which make up the conserved ATP binding motif of known CDKs, and alsolacks the T-loop domain containing the conserved Thr160/161 residue.

The present invention also relates to biologically active fragmentsand/or mutants of a novel isolated nucleic acid molecule which encodemRNA expressing a novel human cyclin-dependent kinase. Such a proteincomprises a novel cyclin binding domain signature sequence(Pro-Asn-Gln-Ala-Leu-Arg-Glu; SEQ ID NO:1), lacks Thr14 and/or Tyr15,and also lack a T-loop domain containing the conserved Thr160/161residue. The protein of the present invention includes but is notlimited to nucleotide substitutions, deletions, additions, aminoterminal truncations and carboxy-terminal truncations such that thesemutations encode mRNA which express a protein or protein fragment ofdiagnostic, therapeutic or prophylactic use.

A preferred aspect of the present invention is disclosed in FIG. 1 andSEQ ID NO:2, a human cDNA encoding a novel cyclin-dependent kinase,CDK10, disclosed herein as:

                   GAAAAGGCGC AGTGGGGCCC GGAGCTGTCA CCCCTGACTC GACGCAGCTT                        CCGTTCTCCT                              (SEQ ID NO:2)                                                                   - GGTGACGTCG                                                                CCTACAGGAA                                                                    CCGCCCCAGT                                                                    GGTCAGCTGC                                                                    CGCGCTGTTG                                                                    CTAGGCAACA                                                                      - GCGTGCGAGC                                                                TCAGATCAGC                                                                    GTGGGGTGGA                                                                    GGAGAAGTGG                                                                    AGTTTGGAAG                                                                    TTCAGGGGCA                                                                      - CAGGGGCACA                                                                GGCCCACGAC                                                                    TGCAGCGGGA                                                                    TGGACCAGTA                                                                    CTGCATCCTG                                                                    GGCCGCATCG                                                                      - GGGAGGGCGC                                                                CCACGGCATC                                                                    GTCTTCAAGG                                                                    CCAAGCACGT                                                                    GGAGACTGGC                                                                    GAGATAGTTG                                                                      - CCCTCAAGAA                                                                GGTGGCCCTA                                                                    AGGCGGTTGG                                                                    AAGACGGCTT                                                                    CCCTAACCAG                                                                    GCCCTGCGGG                                                                      - AGATTAAGGC                                                                TCTGCAGGAG                                                                    ATGGAGGACA                                                                    ATCAGTATGT                                                                    GGTACAACTG                                                                    AAGGCTGTGT                                                                      - TCCCACACGG                                                                TGGAGGCTTT                                                                    GTGCTGGCCT                                                                    TTGAGTTCAT                                                                    GCTGTCGGAT                                                                    CTGGCCGAGG                                                                      - TGGTGCGCCA                                                                TGCCCAGAGG                                                                    CCACTAGCCC                                                                    AGGCACAGGT                                                                    CAAGAGCTAC                                                                    CTGCAGATGC                                                                      - TGCTCAAGGG                                                                TGTCGCCTTC                                                                    TGCCATGCCA                                                                    ACAACATTGT                                                                    ACATCGGGAC                                                                    CTGAAACCTG                                                                      - CCAACCTGCT                                                                CATCAGCGCC                                                                    TCAGGCCAGC                                                                    TCAAGATAGC                                                                    GGACTTTGGC                                                                    CTGGCTCGAG                                                                      - TCTTTTCCCC                                                                AGACGGCAGC                                                                    CGCCTCTACA                                                                    CACACCAGGT                                                                    GGCCACCAGG                                                                    TCTGTGGGCT                                                                      - GCATCATGGG                                                                GGAGCTGTTG                                                                    AATGGGTCCC                                                                    CCCTTTTCCC                                                                    GGGCAAGAAC                                                                    GATATTGAAC                                                                      -                    AGCTTTGCTA TGTGCTTCGC ATCTTGGGCA CCCCAAACCC TCAAGTCTGG CCGGAGCTCA               - CTGAGCTGCC GGACTACAAC AAGATCTCCT TTAAGGAGCA GGTGCCCATG CCCCTGGAGG           - AGGTGCTGCC TGACGTCTCT CCCCAGGCAT TGGATCTGCT GGGTCAATTC CTTCTCTACC           - CTCCTCACCA GCGCATCGCA GCTTCCAAGG CTCTCCTCCA TCAGTACTTC TTCACAGCTC           - CCCTGCCTGC CCATCCATCT GAGCTGCCGA TTCCTCAGCG TCTAGGGGGA CCTGCCCCCA           - AGGCCCATCC AGGGCCCCCC CACATCCATG ACTTCCACGT GGACCGGCCT CTTGAGGAGT           - CGCTGTTGAA CCCAGAGCTG ATTCGGCCCT TCATCCTGGA GGGGTGAGAA GTTGGCCCTG           - GTCCCGTCTG CCTGCTCCTC AGGACCACTC AGTCCACCTG TTCCTCTGCC ACCTGCCTGG           - CTTCACCCTC CAAGGCCTCC CCATGGCCAC AGTGGGCCCA CACCACACCC TGCCCCTTAG           - CCCTTGCGAG GGTTGGTCTC GAGGCAGAGG TCATGTTCCC AGCCAAGAGT ATGAGAACAT           - CCAGTCGAGC AGAGGAGATT CATGGCCTGT GCTCGGTGAG CCTTACCTTC TGTGTGCTAC           - TGACGTACCC ATCAGGACAG TGAGCTCTGC TGCCAGTCAA GGCCTGCATA TGCAGAATGA           - CGATGCCTGC CTTGGTGCTG CTTCCCCGAG TGCTGCCTCC TGGTCAAGGA GAAGTGCAGA           - GAGTAAGGTG TCCTTATGTT GGAAACTCAA GTGGAAGGAA GATTTGGTTT GGTTTTATTC           - TCAGAGCCAT TAAACACTAG TTCAGTATGT GAGATATAGA TTCTAAAAAC CTCAGGTGGC           - TCTGCCTTAT GTCTGTTCTT CCTTCATTTC TCTCAAGGGA AATGGCTAAG GTGGCATTGT           - CTCATGGCTC TCGTTTTTGG GGTCATGGGG AGGGTAGCAC CAGGCATAGC CACTTTTGCC           - CTGAGGGACT CCTGTGTGCT TCACATCACT GAGCACTCAT TTAGAAGTGA GGGAGACAGA           - AGTCTAGGCC CAGGGATGGC TCCAGTTGGG GATCCAGCAG GAGACCCTCT GCACATGAGG           - CTGGTTTACC AACATCTACT CCCTCAGGAT GAGCGTGAGC CAGAAGCAGC TGTGTATTTA           - AGGAAACAAG CGTTCCTGGA ATTAATTTAT AAATTTAATA AATCCCAATA TAATCCCAAA           - AAAAAAAAAA AAAAAATTCC TGCGGCCGCA AGGA.                                

The present invention also relates to a substantially purified novelcyclin-dependent kinase which comprises a novel cyclin binding domainsignature sequence (Pro-Asn-Gln-Ala-Leu-Arg-Glu; SEQ ID NO:1), lacksThr14 and/or Tyr15 as well as the T-loop domain containing the conservedThr160/161 residue. Any such nucleic acid may be isolated andcharacterized from a mammalian cell, including but not limited to human,human and rodent. A human form is an especially preferred form, such asthe isolated cDNA exemplified herein as set forth in SEQ ID NO:2 and adominant negative mutant form as set forth in SEQ ID NO:12.

The present invention also relates to biologically active fragmentsand/or mutants of a novel cyclin-dependent kinase which comprises thenovel cyclin binding domain (Pro-Asn-Gln-Ala-Leu-Arg-Glu; SEQ ID NO:1),lacks Thr14 and/or Tyr15 which make up the conserved ATP binding motifof known CDKs, and also lacks the T-loop domain containing the conservedThr160/161 residue, including but not necessarily limited to amino acidsubstitutions, deletions, additions, amino terminal truncations andcarboxy-terminal truncations such that these mutations provide forproteins or protein fragments of diagnostic, therapeutic or prophylacticuse. Any such nucleic acid may be isolated and characterized from amammalian cell, including but not limited to human, human and rodent,with a human form being an especially preferred form.

A preferred aspect of the present invention is disclosed in SEQ ID NO:3and FIG. 2, the amino acid sequence of CDK10. The open reading frame ofthe CDK10 coding region runs from nucleotide 210 to nucleotide 1182 ofSEQ ID NO:2. The amino acid sequence of the novel cyclin-dependentkinase, CDK10, is disclosed herein as:

                         MDQYCILGRI GEGAHGIVFK AKHVETGEIV ALKKVALRRL EDGFPNQAL                         R                                 (SEQ ID NO:3)                                                                   - EIKALQEMED                                                                NQYVVQLKAV                                                                    FPHGGGFVLA                                                                    FEFMLSDLAE                                                                    VVRHAQRPLA                                                                      - QAQVKSYLQM                                                                LLKGVAFCHA                                                                    NNIVHRDLKP                                                                    ANLLISASGQ                                                                    LKIADFGLAR                                                                      - VFSPDGSRLY                                                                THQVATRSVG                                                                    CIMGELLNGS                                                                    PLFPGKNDIE                                                                    QLCYVLRILG                                                                      - TPNPQVWPEL                                                                TELPDYNKIS                                                                    FKEQVPMPLE                                                                    EVLPDVSPQA                                                                    LDLLGQFLLY                                                                      - PPHQRIAASK                                                                ALLHQYFFTA                                                                    PLPAHPSELP                                                                    IPQRLGGPAP                                                                    KAHPGPPHIH                                                                      - DFHVDRPLEE                                                                SLLNPELIRP FILEG.  

Another preferred aspect of the present invention is disclosed in SEQ IDNO:11, wherein nucleotide 588 of the wild-type form (SEQ ID NO: 2) ismutated from "G" to "A".

Another preferred aspect of the present invention is the mutant protein,(CDK10-D127N), wherein nucleotide 588 of SEQ ID NO:11 is mutated from"G" to "A", as compared to the wild-type form (SEQ ID NO:2), whichresults in a change of Asp127 to Asn127 as compared to the wild-typeamino acid sequence (SEQ ID NO:3), disclosed as SEQ ID NO:12.

The present invention also relates to methods of expressing thecyclin-dependent kinases disclosed herein, assays employing thesecyclin-dependent kinases, cells expressing these cyclin-dependentkinases, and compounds identified through the use of thesecyclin-dependent kinases, including modulators of the cyclin-dependentskinase either through direct contact with the cyclin-dependent kinase,an associated cyclin, or the CKD/cyclin complex. Such modulatorsidentified in this process are useful as therapeutic agents forcontrolling cell growth or immune cell proliferation associated withhuman cancers. Additionally, an isolated and purified CDK10 cDNA whichencodes CDK-10 or an active mutant thereof will also be useful for therecombinant production of large quantities of respective protein. Theability to produce large quantities of the protein would be useful forthe production of a therapeutic agent comprising the CDK10 protein or amutant such as the exemplified mutant disclosed herein. A therapeuticagent comprised of CDK10 protein would be useful in the treatment ofcell cycle and/or CDK10 related diseases or conditions which are CDK10responsive or possibly a therapeutic agent comprised of a mutant,including but not limited to CDK10-D127N, which may be useful in thetreatment of cell cycle diseases or conditions which are responsive tothe regulatory effects of the mutant kinase.

The isolated nucleic acid molecule of the present invention may includea deoxyribonucleic acid molecule (DNA), such as genomic DNA andcomplementary DNA (cDNA), which may be single (coding or noncodingstrand) or double stranded, as well as synthetic DNA, such as asynthesized, single stranded polynucleotide. The isolated nucleic acidmolecule of the present invention may also include a ribonucleic acidmolecule (RNA).

It is known that there is a substantial amount of redundancy in thevarious codons which code for specific amino acids. Therefore, thisinvention is also directed to those DNA sequences which containalternative codons which code for the eventual translation of theidentical amino acid. For purposes of this specification, a sequencebearing one or more replaced codons will be defined as a degeneratevariation. Also included within the scope of this invention aremutations either in the DNA sequence or the translated protein which donot substantially alter the ultimate physical properties of theexpressed protein. For example, substitution of valine for leucine,arginine for lysine, or asparagine for glutamine may not cause a changein functionality of the polypeptide. Therefore, this invention is alsodirected to those DNA sequences which express RNA comprising alternativecodons which code for the eventual translation of the identical aminoacid, as shown below:

A=Ala=Alanine: codons GCA, GCC, GCG, GCU

C=Cys=Cysteine: codons UGC, UGU

D=Asp=Aspartic acid: codons GAC, GAU

E=Glu=Glutamic acid: codons GAA, GAG

F=Phe=Phenylalanine: codons UUC, UUU

G=Gly=Glycine: codons GGA, GGC, GGG, GGU

H=His=Histidine: codons CAC, CAU

I=Ile=Isoleucine: codons AUA, AUC, AUU

K=Lys=Lysine: codons AAA, AAG

L=Leu=Leucine: codons UUA, UUG, CUA, CUC, CUG, CUU

M=Met=Methionine: codon AUG

N=Asp=Asparagine: codons AAC, AAU

P=Pro=Proline: codons CCA, CCC, CCG, CCU

Q=Gln=Glutamine: codons CAA, CAG

R=Arg=Arginine: codons AGA, AGG, CGA, CGC, CGG, CGU

S=Ser=Serine: codons AGC, AGU, UCA, UCC, UCG, UCU

T=Thr=Threonine: codons ACA, ACC, ACG, ACU

V=Val=Valine: codons GUA, GUC, GUG, GUU

W=Trp=Tryptophan: codon UGG

Y=Tyr=Tyrosine: codons UAC, UAU

Therefore, the present invention discloses codon redundancy which mayresult in differing DNA molecules expressing an identical protein. Forpurposes of this specification, a sequence bearing one or more replacedcodons will be defined as a degenerate variation. Also included withinthe scope of this invention are mutations either in the DNA sequence orthe translated protein which do not substantially alter the ultimatephysical properties of the expressed protein. For example, substitutionof valine for leucine, arginine for lysine, or asparagine for glutaminemay not cause a change in functionality of the polypeptide.

It is known that DNA sequences coding for a peptide may be altered so asto code for a peptide having properties that are different than those ofthe naturally occurring peptide. Methods of altering the DNA sequencesinclude but are not limited to site directed mutagenesis. Examples ofaltered properties include but are not limited to changes in theaffinity of an enzyme for a substrate or a receptor for a ligand.

As used herein, a "biologically active equivalent" or "functionalderivative" of a wild type CDK possesses a biological activity that issubstantially similar to the biological activity of the wild type CDK10protein. The term "functional derivative" is intended to include the"fragments," "mutants," "variants," "degenerate variants," "analogs" and"homologues" or to "chemical derivatives" of the wild type CDK10protein. The term "fragment" is meant to refer to any polypeptide subsetof wild type CDK10. The term "mutant" is meant to refer to a moleculethat may be substantially similar to the wild type form but possessesdistinguishing biological characteristics. Such altered characteristicsinclude but are in no way limited to altered enzymatic activity, alteredcyclin binding altered substrate binding, altered substrate affinity andaltered sensitivity to chemical compounds affecting biological activity.An exemplified mutant is CDK10-D127N, wherein a single base mutation atnucleotide 588 of SEQ ID NO:2 results in a single amino acidsubstitution at residue 127, from aspartic acid to asparagine. Thismutation alters kinase activity of CDK10-D127N as compared to the wildtype CDK10 protein. The term "variant" is meant to refer to a moleculesubstantially similar in structure and function to either the entirewild type protein or to a fragment thereof. A molecule is "substantiallysimilar" to a wild type CDK10-like protein if both molecules havesubstantially similar structures or if both molecules possess similarbiological activity. Therefore, if the two molecules possesssubstantially similar activity, they are considered to be variants evenif the structure of one of the molecules is not found in the other oreven if the two amino acid sequences are not identical.

The term "analog" refers to a molecule substantially similar in functionto either the entire wild type CDK10-like protein or to a fragmentthereof.

"Substantial homology" or "substantial similarity", when referring tonucleic acids means that the segments or their complementary strands,when optimally aligned and compared, are identical with appropriatenucleotide insertions or deletions, in at least 75% of the nucleotides.Alternatively, substantial homology exists when the segments willhybridize to a strand or its complement.

The term "substantial homology", when referring to polypeptides,indicates that the polypeptide or protein in question exhibits at leastabout 30% homology with the naturally occurring protein in question,usually at least about 65% homology.

The nucleic acids claimed herein may be present in whole cells or incell lysates or in a partially purified or substantially purified form.A nucleic acid is considered substantially purified when it is purifiedaway from environmental contaminants. Thus, a nucleic acid sequenceisolated from cells is considered to be substantially purified whenpurified from cellular components by standard methods while a chemicallysynthesized nucleic acid sequence is considered to be substantiallypurified when purified from its chemical precursors.

Any of a variety of procedures may be used to clone CDK10. These methodsinclude, but are not limited to, (1) a RACE PCR cloning technique(Frohman, et al., 1988, Proc. Natl. Acad. Sci. 85: 8998-9002). 5' and/or3' RACE may be performed to generate a full length cDNA sequence. Thisstrategy involves using gene-specific oligonucleotide primers for PCRamplification of CDK10 cDNA. These gene-specific primers are designedthrough identification of an expressed sequence tag (EST) nucleotidesequence which has been identified by searching any number of publiclyavailable nucleic acid and protein databases; (2) direct functionalexpression of the CDK10 cDNA following the construction of anCDK10-containing cDNA library in an appropriate expression vectorsystem; (3) screening a CDK10-containing cDNA library constructed in abacteriophage or plasmid shuttle vector with a labeled degenerateoligonucleotide probe designed from the amino acid sequence of the CDK10protein; (4) screening a CDK10-containing cDNA library constructed in abacteriophage or plasmid shuttle vector with a partial cDNA encoding theCDK10 protein. This partial cDNA is obtained by the specific PCRamplification of CDK10 DNA fragments through the design of degenerateoligonucleotide primers from the amino acid sequence known for other CDKkinases which are related to the CDK10 protein; (5) screening anCDK10-containing cDNA library constructed in a bacteriophage or plasmidshuttle vector with a partial cDNA encoding the CDK10 protein. Thisstrategy may also involve using gene-specific oligonucleotide primersfor PCR amplification of CDK10 cDNA identified as an EST as describedabove; or (6) designing 5' and 3' gene specific oligonucleotides usingSEQ ID NO:2 as a template so that either the full length cDNA may begenerated by known RACE techniques, or a portion of the coding regionmay be generated by these same known RACE techniques to generate andisolate a portion of the coding region to use as a probe to screen oneof numerous types of cDNA and/or genomic libraries in order to isolate afull length version of the nucleotide sequence encoding CDK10.

It is readily apparent to those skilled in the art that other types oflibraries, as well as libraries constructed from other cells types orspecies types, may be useful for isolating a CDK10-encoding DNA or aCDK10 homologue. Other types of libraries include, but are not limitedto, cDNA libraries derived from other cells or cell lines other thanhuman cells or tissue such as murine cells, rodent cells or any othersuch vertebrate host which may contain a CDK10-encoding DNA.Additionally a CDK10 gene may be isolated by oligonucleotide- orpolynucleotide-based hybridization screening of a vertebrate genomiclibrary, including but not limited to a human genomic library, a murinegenomic library and a rodent genomic library, as well as concomitanthuman genomic DNA libraries.

It is readily apparent to those skilled in the art that suitable cDNAlibraries may be prepared from cells or cell lines which have CDK10activity. The selection of cells or cell lines for use in preparing acDNA library to isolate a CDK10 cDNA may be done by first measuring cellassociated CDK10 activity using any known assay for CDK activity.

Preparation of cDNA libraries can be performed by standard techniqueswell known in the art. Well known cDNA library construction techniquescan be found for example, in Sambrook, et al., 1989, Molecular Cloning:A Laboratory Manual; Cold Spring Harbor Laboratory, Cold Spring Harbor,N.Y. Complementary DNA libraries may also be obtained from numerouscommercial sources, including but not limited to Clontech Laboratories,Inc. and Stratagene.

It is also readily apparent to those skilled in the art that DNAencoding CDK10 may also be isolated from a suitable genomic DNA library.Construction of genomic DNA libraries can be performed by standardtechniques well known in the art. Well known genomic DNA libraryconstruction techniques can be found in Sambrook, et al., supra.

In order to clone the CDK10 gene by one of the preferred methods, theamino acid sequence or DNA sequence of CDK10 or a homologous protein maybe necessary. To accomplish this, the CDK10 or a homologous protein maybe purified and partial amino acid sequence determined by automatedsequenators. It is not necessary to determine the entire amino acidsequence, but the linear sequence of two regions of 6 to 8 amino acidscan be determined for the PCR amplification of a partial CDK10 DNAfragment. Once suitable amino acid sequences have been identified, theDNA sequences capable of encoding them are synthesized. Because thegenetic code is degenerate, more than one codon may be used to encode aparticular amino acid, and therefore, the amino acid sequence can beencoded by any of a set of similar DNA oligonucleotides. Only one memberof the set will be identical to the CDK10 sequence but others in the setwill be capable of hybridizing to CDK10 DNA even in the presence of DNAoligonucleotides with mismatches. The mismatched DNA oligonucleotidesmay still sufficiently hybridize to the CDK10 DNA to permitidentification and isolation of CDK10 encoding DNA. Alternatively, thenucleotide sequence of a region of an expressed sequence may beidentified by searching one or more available genomic databases.Gene-specific primers may be used to perform PCR amplification of a cDNAof interest from either a cDNA library or a population of cDNAs. Asnoted above, the appropriate nucleotide sequence for use in a PCR-basedmethod may be obtained from SEQ ID NO:2, either for the purpose ofisolating overlapping 5' and 3' RACE products for generation of afull-length sequence coding for CDK10, or to isolate a portion of thenucleotide sequence coding for CDK10 for use as a probe to screen one ormore cDNA- or genomic-based libraries to isolate a full-length sequenceencoding CDK10 or CDK10-like proteins.

In an exemplified method, the RACE PCR technique (Frohman, et al., 1988,Proc. Natl. Acad. Sci 85: 8998-9002) is used for cloning a 5' codingregion of CDK10 encoding DNA. First round PCR used adapter-ligated humanplacenta cDNA template (from Clontech), gene-specific primer PK22L234,(5'-TGATGCAGCCCACAGACCTG-3'; SEQ ID NO: 4) and an adapter primer AP1(5'-CCATCCTAATACGACTCACTATAGGGC-3'; SEQ ID NO:5). PCR amplification wasperformed using the Elongase™. Thermal cycling was completed and aportion of this first PCR reaction was added to a second PCR reaction asDNA template. This PCR reaction also differed from the first PCRreaction in that the nested gene specific primer PK22L161

(5'-GCCGTCTGGGGAAAAGA-3'; SEQ ID NO:6) and the nested adapter primer AP2

(5'-ACTCACTATAGGGCTCGAGCGGC-3', SEQ ID NO:7) were utilized.

An approximately 600 bp DNA product was identified from a 1% agaroseelectrophoresis gel, excised, and purified using a Qiagen PCR-spuncolumn (Qiaquick™). This fragment was used directly for DNA sequencingusing PK22L161 and AP2 primers, and for cloning into pCR2.1 using theInvitrogen TA-cloning kit.

A DNA fragment 3' to and overlapping the 600 bp 5' fragment wasidentified by searching public nucleic acid and protein databases. This3' fragment is an approximately 1.8 Kb cDNA insert available as aNotI-HindIII fragment in a typical phagemid vector. This cDNA clone isreadily identified by Genbank Accession No. H17727, Image Clone ID No.50484, Washington University Clone ID No. ym40a06, and GBD Clone ID No.423294. This cDNA was isolated from a library constructed from humaninfant brain mRNA. This construct is available from Research Genetics,Inc., 2130 Memorial Parkway SW, Hunstville, Ala. 35801 (http://www.resgen.com).

A full length CDK10 coding region was assembled in pLITMUS28 (NewEngland Biolabs) as an expression cassette with a BamHI site appendedjust 5' to the ATG translational start codon. A BamHI-XbaI fragmentbearing CDK10 was recloned into pcDNA3.1 expression vector (Invitrogen)and a BamHI-NcoI fragment bearing CDK10 was recloned into pBlueBacHis2baculovirus expression vector (Invitrogen). A similar construct wasgenerated which contains dominant-negative single base pair mutation ofCDK10. This mutant was generated from pLITMUS28::CDK10 using theStratagene "Quik Change" kit and primers 22U-D127N

(5'-CAACATTGTACATCGGAACCTGAAACCTGCC-3'; SEQ ID NO: 8) and 22L-D127N

(5'-GGCAGGTTTCAGGTTCC-GATGTACAATGTTG-3'; SEQ ID NO: 9). Both mutantconstructions were subcloned into pcDNA3.1 (as a BamHI-XbaI fragment)and pBlueBacHis2 (as a BamHI-NcoI fragment), respectively.

The sequence for the 5' upstream sequences, coding region and 3'untranslated sequences for the human full-length cDNA encoding CDK10 isshown in SEQ ID NO:2. The deduced amino acid sequence of CDK10 from thecloned cDNA is shown in SEQ ID NO:3. Inspection of the determined cDNAsequence reveals the presence of a single open reading frame thatencodes a 325 amino acid protein. The open reading frame of the CDK10coding region runs from nucleotide 210 to nucleotide 1182 of SEQ IDNO:2.

The nucleotide sequence which encodes a preferred mutant form (Asp127 toAsn127), is disclosed as SEQ ID NO:11.

The amino acid sequence for this preferred mutant form, CDK10-D127N, isdisclosed in SEQ ID NO:12.

A variety of mammalian expression vectors may be used to expressrecombinant CDK10 in mammalian cells. Expression vectors are definedherein as DNA sequences that are required for the transcription ofcloned DNA and the translation of their mRNAs in an appropriate host.Such vectors can be used to express eukaryotic DNA in a variety of hostssuch as bacteria, blue green algae, plant cells, insect cells and animalcells. Specifically designed vectors allow the shuttling of DNA betweenhosts such as bacteria-yeast or bacteria-animal cells. An appropriatelyconstructed expression vector should contain: an origin of replicationfor autonomous replication in host cells, selectable markers, a limitednumber of useful restriction enzyme sites, a potential for high copynumber, and active promoters. A promoter is defined as a DNA sequencethat directs RNA polymerase to bind to DNA and initiate RNA synthesis. Astrong promoter is one which causes mRNAs to be initiated at highfrequency. Expression vectors may include, but are not limited to,cloning vectors, modified cloning vectors, specifically designedplasmids or viruses.

Commercially available mammalian expression vectors which may besuitable for recombinant CDK10 expression, include but are not limitedto, pcDNA3.1 (Invitrogen), pBlueBacHis2 (Invitrogen), pLITMUS28,pLITMUS29, pLITMUS38 and pLITMUS39 (New England Bioloabs), pcDNAI,pcDNAIamp (Invitrogen), pcDNA3 (Invitrogen), pMC1neo (Stratagene), pXT1(Stratagene), pSG5 (Stratagene), EBO-pSV2-neo (ATCC 37593) pBPV-1(8-2)(ATCC 37110), pdBPV-MMTneo(342-12) (ATCC 37224), pRSVgpt (ATCC 37199),pRSVneo (ATCC 37198), pSV2-dhfr (ATCC 37146), pUCTag (ATCC 37460), andλZD35 (ATCC 37565).

A variety of bacterial expression vectors may be used to expressrecombinant CDK10 in bacterial cells. Commercially available bacterialexpression vectors which may be suitable for recombinant CDK10expression include, but are not limited to pCR2.1 (Invitrogen), pET11a(Novagen), lambda gt11 (Invitrogen), pcDNAII (Invitrogen), pKK223-3(Pharmacia).

A variety of fungal cell expression vectors may be used to expressrecombinant CDK10 in fungal cells. Commercially available fungal cellexpression vectors which may be suitable for recombinant CDK10expression include but are not limited to pYES2 (Invitrogen), Pichiaexpression vector (Invitrogen).

A variety of insect cell expression vectors may be used to expressrecombinant receptor in insect cells. Commercially available insect cellexpression vectors which may be suitable for recombinant expression ofCDK10 include but are not limited to pBlueBacIII and pBlueBacHis2(Invitrogen).

The expression vector may be introduced into host cells via any one of anumber of techniques including but not limited to transformation,transfection, lipofection, protoplast fusion, and electroporation. Theexpression vector-containing cells are clonally propagated andindividually analyzed to determine whether they produce CDK10 protein.Identification of CDK10 expressing host cell clones may be done byseveral means, including but not limited to immunological reactivitywith anti-CDK10 antibodies.

Expression of CDK10 DNA may also be performed using in vitro producedsynthetic mRNA or native mRNA. Synthetic mRNA or mRNA isolated fromCDK10 producing cells can be efficiently translated in various cell-freesystems, including but not limited to wheat germ extracts andreticulocyte extracts, as well as efficiently translated in cell basedsystems, including but not limited to microinjection into frog oocytes,with microinjection into frog oocytes being preferred.

An expression vector containing DNA encoding a CDK10-like protein may beused for expression of CDK10 in a recombinant host cell. Recombinanthost cells may be prokaryotic or eukaryotic, including but not limitedto bacteria such as E. coli, fungal cells such as yeast, mammalian cellsincluding but not limited to cell lines of human, bovine, porcine,monkey and rodent origin, and insect cells including but not limited toDrosophila and silkworm derived cell lines. Cell lines derived frommammalian species which may be suitable and which are commerciallyavailable, include but are not limited to, L cells L-M(TK⁻) (ATCC CCL1.3), L cells L-M (ATCC CCL 1.2), Saos-2 (ATCC HTB-85), 293 (ATCC CRL1573), Raji (ATCC CCL 86), CV-1 (ATCC CCL 70), COS-1 (ATCC CRL 1650),COS-7 (ATCC CRL 1651), CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), NIH/3T3(ATCC CRL 1658), HeLa (ATCC CCL 2), C1271 (ATCC CRL 1616), BS-C-1 (ATCCCCL 26) and MRC-5 (ATCC CCL 171).

The expression vector may be introduced into host cells via any one of anumber of techniques including but not limited to transformation,transfection, protoplast fusion, and electroporation. The expressionvector-containing cells are individually analyzed to determine whetherthey produce CDK10 protein. Identification of CDK10 expressing cells maybe done by several means, including but not limited to immunologicalreactivity with anti-CDK10 antibodies, and the presence of hostcell-associated CDK10 activity.

The cloned CDK10 cDNA obtained through the methods described above maybe recombinantly expressed by molecular cloning into an expressionvector (such as pcDNA3.1, pCR2.1, pBlueBacHis2 and pLITMUS28) containinga suitable promoter and other appropriate transcription regulatoryelements, and transferred into prokaryotic or eukaryotic host cells toproduce recombinant CDK10. Techniques for such manipulations can befound described in Sambrook, et al., supra, are discussed at length inthe Example section and are well known and easily available to theartisan of ordinary skill in the art.

Expression of CDK10 DNA may also be performed using in vitro producedsynthetic mRNA. Synthetic mRNA can be efficiently translated in variouscell-free systems, including but not limited to wheat germ extracts andreticulocyte extracts, as well as efficiently translated in cell basedsystems, including but not limited to microinjection into frog oocytes,with microinjection into frog oocytes being preferred.

To determine the CDK10 cDNA sequence(s) that yields optimal levels ofCDK10 protein, CDK10 cDNA molecules including but not limited to thefollowing can be constructed: the full-length open reading frame of theCDK10 cDNA and various constructs containing portions of the cDNAencoding only specific domains of the protein or rearranged domains ofthe protein. All constructs can be designed to contain none, all orportions of the 5' and/or 3' untranslated region of CDK10. CDK10activity and levels of protein expression can be determined followingthe introduction, both singly and in combination, of these constructsinto appropriate host cells. Following determination of the CDK10 cDNAcassette yielding optimal expression in transient assays, this CDK10cDNA construct is transferred to a variety of expression vectors(including recombinant viruses), including but not limited to those formammalian cells, plant cells, insect cells, oocytes, bacteria, and yeastcells.

Levels of CDK10 protein in host cells is quantified by a variety oftechniques including, but not limited to, immunoaffinity and/or ligandaffinity techniques. CDK10-specific affinity beads or CDK10-specificantibodies are used to isolate ³⁵ S-methionine labeled or unlabelledCDK10 protein. Labeled CDK10 protein is analyzed by SDS-PAGE. UnlabelledCDK10 protein is detected by Western blotting, ELISA or RIA assaysemploying CDK10 specific antibodies.

Following expression of CDK10 in a host cell, CDK10 protein may berecovered to provide CDK10 in active form. Several CDK10 purificationprocedures are available and suitable for use. Recombinant CDK10 may bepurified from cell lysates and extracts, or from conditioned culturemedium, by various combinations of, or individual application of saltfractionation, ion exchange chromatography, size exclusionchromatography, hydroxylapatite adsorption chromatography andhydrophobic interaction chromatography.

In addition, recombinant CDK10 can be separated from other cellularproteins by use of an immuno-affinity column made with monoclonal orpolyclonal antibodies specific for full length CDK10, or polypeptidefragments of CDK10. Additionally, polyclonal or monoclonal antibodiesmay be raised against a synthetic peptide (usually from about 9 to about25 amino acids in length) from a portion of the protein as disclosed inSEQ ID NO:3. Monospecific antibodies to CDK10 are purified frommammalian antisera containing antibodies reactive against CDK10 or areprepared as monoclonal antibodies reactive with CDK10 using thetechnique of Kohler and Milstein (1975, Nature 256:495-497).Monospecific antibody as used herein is defined as a single antibodyspecies or multiple antibody species with homogenous bindingcharacteristics for CDK10. Homogenous binding as used herein refers tothe ability of the antibody species to bind to a specific antigen orepitope, such as those associated with the CDK10, as described above.CDK10 specific antibodies are raised by immunizing animals such as mice,rats, guinea pigs, rabbits, goats, horses and the like, with anappropriate concentration of CDK10 or CDK10 synthetic peptide eitherwith or without an immune adjuvant.

Preimmune serum is collected prior to the first immunization. Eachanimal receives between about 0.1 μg and about 1000 μg of CDK10associated with an acceptable immune adjuvant. Such acceptable adjuvantsinclude, but are not limited to, Freund's complete, Freund's incomplete,alum-precipitate, water in oil emulsion containing Corynebacteriumparvum and tRNA. The initial immunization consists of the CDK10 proteinor CDK10 synthetic peptide in, preferably, Freund's complete adjuvant atmultiple sites either subcutaneously (SC), intraperitoneally (IP) orboth. Each animal is bled at regular intervals, preferably weekly, todetermine antibody titer. The animals may or may not receive boosterinjections following the initial immunizaiton. Those animals receivingbooster injections are generally given an equal amount of CDK10 inFreund's incomplete adjuvant by the same route. Booster injections aregiven at about three week intervals until maximal titers are obtained.At about 7 days after each booster immunization or about weekly after asingle immunization, the animals are bled, the serum collected, andaliquots are stored at about -20° C.

Monoclonal antibodies (mAb) reactive with CDK10 are prepared byimmunizing inbred mice, preferably Balb/c, with CDK10. The mice areimmunized by the IP or SC route with about 1 μg to about 100 μg,preferably about 10 μg, of CDK10 in about 0.5 ml buffer or salineincorporated in an equal volume of an acceptable adjuvant, as discussedabove. Freund's complete adjuvant is preferred. The mice receive aninitial immunization on day 0 and are rested for about 3 to about 30weeks. Immunized mice are given one or more booster immunizations ofabout 1 to about 100 μg of CDK10 in a buffer solution such as phosphatebuffered saline by the intravenous (IV) route. Lymphocytes, fromantibody positive mice, preferably splenic lymphocytes, are obtained byremoving spleens from immunized mice by standard procedures known in theart. Hybridoma cells are produced by mixing the splenic lymphocytes withan appropriate fusion partner, preferably myeloma cells, underconditions which will allow the formation of stable hybridomas. Fusionpartners may include, but are not limited to: mouse myelomas P3/NS1/Ag4-1; MPC-11; S-194 and Sp 2/0, with Sp 2/0 being preferred. The antibodyproducing cells and myeloma cells are fused in polyethylene glycol,about 1000 mol. wt., at concentrations from about 30% to about 50%.Fused hybridoma cells are selected by growth in hypoxanthine, thymidineand aminopterin supplemented Dulbecco's Modified Eagles Medium (DMEM) byprocedures known in the art. Supernatant fluids are collected formgrowth positive wells on about days 14, 18, and 21 and are screened forantibody production by an immunoassay such as solid phaseimmunoradioassay (SPIRA) using CDK10 as the antigen. The culture fluidsare also tested in the Ouchterlony precipitation assay to determine theisotype of the mAb. Hybridoma cells from antibody positive wells arecloned by a technique such as the soft agar technique of MacPherson,1973, Soft Agar Techniques, in Tissue Culture Methods and Applications,Kruse and Paterson, Eds., Academic Press.

Monoclonal antibodies are produced in vivo by injection of pristineprimed Balb/c mice, approximately 0.5 ml per mouse, with about 2×10⁶ toabout 6×10⁶ hybridoma cells about 4 days after priming. Ascites fluid iscollected at approximately 8-12 days after cell transfer and themonoclonal antibodies are purified by techniques known in the art.

In vitro production of anti-CDK10 mAb is carried out by growing thehydridoma in DMEM containing about 2% fetal calf serum to obtainsufficient quantities of the specific mAb. The mAb are purified bytechniques known in the art.

Antibody titers of ascites or hybridoma culture fluids are determined byvarious serological or immunological assays which include, but are notlimited to, precipitation, passive agglutination, enzyme-linkedimmunosorbent antibody (ELISA) technique and radioimmunoassay (RIA)techniques. Similar assays are used to detect the presence of CDK10 inbody fluids or tissue and cell extracts.

It is readily apparent to those skilled in the art that the abovedescribed methods for producing monospecific antibodies may be utilizedto produce antibodies specific for CDK10 polypeptide fragments, orfull-length CDK10 polypeptide.

CDK10 antibody affinity columns are made by adding the antibodies toAffigel-10 (Biorad), a gel support which is pre-activated withN-hydroxysuccinimide esters such that the antibodies form covalentlinkages with the agarose gel bead support. The antibodies are thencoupled to the gel via amide bonds with the spacer arm. The remainingactivated esters are then quenched with 1M ethanolamine HC1 (pH 8). Thecolumn is washed with water followed by 0.23 M glycine HC1 (pH 2.6) toremove any non-conjugated antibody or extraneous protein. The column isthen equilibrated in phosphate buffered saline (pH 7.3) and the cellculture supernatants or cell extracts containing CDK10 or CDK10fragments are slowly passed through the column. The column is thenwashed with phosphate buffered saline until the optical density (A₂₈₀)falls to background, then the protein is eluted with 0.23 M glycine-HCl(pH 2.6). The purified CDK10 protein is then dialyzed against phosphatebuffered saline.

The novel CDK10 of the present invention is suitable for use in an assayprocedure for the identification of compounds which modulate CDK10activity. Modulating CDK10 activity, as described herein includes theinhibition or activation of the protein and also includes directly orindirectly affecting the cell cycle regulatory properties associatedwith CDK10 activity. Compounds which modulate CDK10 activity includeagonists, antagonists, inhibitors, activators, and compounds whichdirectly or indirectly affect regulation of the CDK10 activity and/orthe CDK10/cyclin association.

The CDK10 protein kinase of the present invention may be obtained fromboth native and recombinant sources for use in an assay procedure toidentify CDK10 modulators. In general, an assay procedure to identifyCDK10 modulators will contain the CDK10-protein of the presentinvention, native cyclin protein which will form a CDK10/cyclin complex,and a test compound or sample which contains a putative CDK10 modulator.The test compounds or samples may be tested directly on, for example,purified CDK10 protein whether native or recombinant, subcellularfractions of CDK10-producing cells whether native or recombinant, and/orwhole cells expressing the CDK10 whether native or recombinant. The testcompound or sample may be added to the CDK10 in the presence or absenceof a known CDK10 modulator. The modulating activity of the test compoundor sample may be determined by, for example, analyzing the ability ofthe test compound or sample to bind to CDK10 protein, activate theprotein, inhibit CDK10 activity, inhibit or enhance the binding of othercompounds to the CDK10 protein, modifying receptor regulation, ormodifying an intracellular activity.

The identification of modulators of CDK10 activity are useful intreating disease states involving the cell cycle will be useful incontrolling cell growth associated with cancer or immune cellproliferation. Other compounds may be useful for stimulating orinhibiting activity of the enzyme. These compounds could be of use inthe treatment of diseases in which activation or inactivation of theCDK10 protein results in either cellular proliferation, cell death,nonproliferation, induction of cellular neoplastic transformations ormetastatic tumor growth and hence could be used in the prevention and/ortreatment of various cancers.

The present invention is also directed to methods for screening forcompounds which modulate the expression of DNA or RNA encoding a CDKprotein of the present invention or which modulates the function of asuch a CDK protein. Compounds which modulate these activities may beDNA, RNA, peptides, proteins, or non-proteinaceous organic molecules.Compounds may modulate by increasing or attenuating the expression ofDNA or RNA encoding the CDK protein, or the function of a CDK protein.Compounds that modulate the expression of DNA or RNA encoding the CDKprotein or the biological function thereof may be detected by a varietyof assays. The assay may be a simple "yes/no" assay to determine whetherthere is a change in expression or function. The assay may be madequantitative by comparing the expression or function of a test samplewith the levels of expression or function in a standard sample. Kitscontaining modified CDK10, antibodies to CDK10, or modified CDK10protein may be prepared by known methods for such uses.

The DNA molecules, RNA molecules, recombinant protein and antibodies ofthe present invention may be used to screen and measure levels of CDK10DNA, RNA or protein. The recombinant proteins, DNA molecules, RNAmolecules and antibodies lend themselves to the formulation of kitssuitable for the detection and typing of CDK10. Such a kit wouldcomprise a compartmentalized carrier suitable to hold in closeconfinement at least one container. The carrier would further comprisereagents such as recombinant CDK10 protein or anti-CDK10 antibodiessuitable for detecting CDK10. The carrier may also contain a means fordetection such as labeled antigen or enzyme substrates or the like.

Pharmaceutically useful compositions comprising modulators of CDK10 maybe formulated according to known methods such as by the admixture of apharmaceutically acceptable carrier. Examples of such carriers andmethods of formulation may be found in Remington's PharmaceuticalSciences. To form a pharmaceutically acceptable composition suitable foreffective administration, such compositions will contain an effectiveamount of the protein, DNA, RNA, or modified CDK10.

Therapeutic or diagnostic compositions of the invention are administeredto an individual in amounts sufficient to treat or diagnose disorders.The effective amount may vary according to a variety of factors such asthe individual's condition, weight, sex and age. Other factors includethe mode of administration.

The pharmaceutical compositions may be provided to the individual by avariety of routes such as subcutaneous, topical, oral and intramuscular.

The term "chemical derivative" describes a molecule that containsadditional chemical moieties which are not normally a part of the basemolecule. Such moieties may improve the solubility, half-life,absorption, etc. of the base molecule. Alternatively the moieties mayattenuate undesirable side effects of the base molecule or decrease thetoxicity of the base molecule. Examples of such moieties are describedin a variety of texts, such as Remington's Pharmaceutical Sciences.

Compounds identified according to the methods disclosed herein may beused alone at appropriate dosages. Alternatively, co-administration orsequential administration of other agents may be desirable.

The present invention also has the objective of providing suitabletopical, oral, systemic and parenteral pharmaceutical formulations foruse in the novel methods of treatment of the present invention. Thecompositions containing compounds identified according to this inventionas the active ingredient can be administered in a wide variety oftherapeutic dosage forms in conventional vehicles for administration.For example, the compounds can be administered in such oral dosage formsas tablets, capsules (each including timed release and sustained releaseformulations), pills, powders, granules, elixirs, tinctures, solutions,suspensions, syrups and emulsions, or by injection. Likewise, they mayalso be administered in intravenous (both bolus and infusion),intraperitoneal, subcutaneous, topical with or without occlusion, orintramuscular form, all using forms well known to those of ordinaryskill in the pharmaceutical arts.

Advantageously, compounds of the present invention may be administeredin a single daily dose, or the total daily dosage may be administered individed doses of two, three or four times daily. Furthermore, compoundsfor the present invention can be administered in intranasal form viatopical use of suitable intranasal vehicles, or via transdermal routes,using those forms of transdermal skin patches well known to those ofordinary skill in that art. To be administered in the form of atransdermal delivery system, the dosage administration will, of course,be continuous rather than intermittent throughout the dosage regimen.

For combination treatment with more than one active agent, where theactive agents are in separate dosage formulations, the active agents canbe administered concurrently, or they each can be administered atseparately staggered times.

The dosage regimen utilizing the compounds of the present invention isselected in accordance with a variety of factors including type,species, age, weight, sex and medical condition of the patient; theseverity of the condition to be treated; the route of administration;the renal and hepatic function of the patient; and the particularcompound thereof employed. A physician or veterinarian of ordinary skillcan readily determine and prescribe the effective amount of the drugrequired to prevent, counter or arrest the progress of the condition.Optimal precision in achieving concentrations of drug within the rangethat yields efficacy without toxicity requires a regimen based on thekinetics of the drug's availability to target sites. This involves aconsideration of the distribution, equilibrium, and elimination of adrug.

Isolated and purified CDK10 is also be useful for the recombinantproduction of large quantities of CDK10 protein. The ability to producelarge quantities of the protein would be useful for the production of atherapeutic agent comprising the CDK10 protein. A therapeutic agentcomprised of CDK10 protein would be useful in the treatment of cellcycle and/or CDK10 related diseases or conditions which are CDK10responsive.

By computer analysis of a genomic database, molecular cloning and DNAsequencing a novel member of the human CDK gene family has beenidentified. This new cDNA fragment encodes a novel cyclin-dependentkinase which comprises a novel cyclin binding domain signature sequence,lacks Thr14 and/or Tyr15 within the conserved ATP binding motif of knownCDKs, and also lacks the T-loop domain containing the conservedThr160/161 residue.

Northern hybridization experiments with RNA from various cell andtissues indicates that CDK10 is expressed in various human tissue,including brain, testis, pituitary gland and adrenal gland derived cellsor tissues.

The following examples are provided as illustrative of the presentinvention without, however, limiting the same thereto.

EXAMPLE 1 Isolation and Characterization of DNA Fragments Encoding CDK

A 3' portion of the CDK10 coding region was detected among theMerck-Washington University EST's as 5' EST H17727. EST "H17727"resembled several CDK anid MAPK genes. The pH17727 plasmid constructcomprising the 3' coding region and 3' untranslated region of CDK10 iscontained within a NotI-HindIII fragment of approximately 1.8 Kb, in atypical phagemid vector. The 5' portion of this fragment overlaps the 3'end of the 600 bp. This cDNA clone is publicly available by GenbankAccession No. H17727, Image Clone ID No. 50484, Washington UniversityClone ID No. ym40a06, and GBD Clone ID No. 423294. This cDNA wasisolated from a library constructed from human infant brain mRNA. Thisconstruct is available from Research Genetics, Inc., 2130 MemorialParkway SW, Hunstville, Ala. 35801 (http://www. resgen.com).

The 5' portion of the gene was isolated by performing 5' RACE (Frohman,et al., 1988, Proc. Natl. Acad. Sci. 85: 8998-9002) usingMarathon™-ready human placenta cDNA available from Clontech (Protocol#PT1156-1, Catalog #K1802-1). Adapter-ligated double stranded cDNAgenerated from human placenta mRNA was used as a template for PCRamplification using a gene specific primer PK22L234(5'-TGATGCAGCCCACAGACCTG-3'; SEQ ID NO: 4) and an adapter primer AP1(5'-CCATCCTAATACGACTCACTATAGGGC-3' SEQ ID NO:5). PCR amplification wasperformed using the Elongase™ long-PCR enzyme mix (stored in 20 mMTris-HCl (pH 8.0 at 25° C.), 0.1 mM EDTA, 1 mM DTT, stabilizers and50%(v/v) glycerol) and PCR reaction buffer obtained from Gibco-BRL. Thebuffer comprised 300 mM Tris-SO₄ (pH 9.1 at 25° C.), 90 mM (NH₄)₂ SO₄and 1.5 mM MgSO₄. Two microliters of Marathon placenta cDNA template and10 pmoles each of PK22L234 and AP1 were added to the reaction mix andbrought to a total volume of 20 ml with sterile water. Thermal cyclingwas (1) 94° C./30 sec, 68° C./6 min for 5 cycles; (2) 94° C./30 sec, 64°C./30 sec, 68° C./4 min for 5 cycles; and, (3) 94° C./30 sec, 62° C./30sec and 68° C./4 min for 30 cycles. One microliter from a 1/20 dilutionof this first PCR reaction was added to a second PCR reaction as DNAtemplate. This PCR reaction also differed from the first PCR reaction inthat nested primers PK22L161 (5'-GCCGTCTGGGGAAAAGA-3'; SEQ ID NO:6) andAP2 (5'-ACTCACTATAGGGCTCGAGCGGC-3', SEQ ID NO:7) were used. Anapproximately 600 bp PCR product was identified from a 1% agaroseelectrophoresis gel, excised, and purified using a Qiagen PCR-spuncolumn. This fragment was used directly for DNA sequencing usingPK22L161 and AP2 oligonucleotide primers.

The Marathon™-ready human placenta cDNA available from Clontech isenhanced by ligation of a double-stranded, 5' overhang adapter to thedouble stranded cDNA template. The 3' end of the adapter is blocked byan amine group to prevent extension during PCR amplification. It iswithin the non-extended 3' region that the AP1 oligo will hybridize.Therefore, AP1 does not hybridize and extend any of the original cDNAtemplate molecules, instead beginning extension and amplification in thesecond round of PCR.

EXAMPLE 2 Construction of a Full Length DNA Fragment Encoding CDK10

The 3' portion of a DNA fragment which encodes CDK10 is contained withina DNA plasmid vector, pH17727. This insert contains a 5' XhoI siteunique to the insert and a NcoI site in the 3' unstranslated regionunique to the insert. This XhoI-NcoI fragment was isolated and subclonedinto XhoI-NcoI digested pLITMUS28 plasmid DNA (New England Biolabs),resulting in pLITMUS28:H17727.

The 600 bp PCR fragments obtained from 5' RACE were cloned into pCR2.1(Invitrogen) using the Invitrogen TA-cloning kit as described by themanufacturer. A PmlI restriction site is located at approximately themidpoint of the 600 bp PCR product. The PmlI site was used to constructa wild type form of the 600 bp 5' fragment from 2 independent 5' RACEPCR clones, pPK22bo4 and pPK22do4. The PmlI-BamHI restriction fragmentof pPK22bo4 (which contains a mutation 3' to the PmlI site) was replacedwith the with the PmlI-BamHI fragment of clone pPK22do4 (which containsa mutation 5' to the PmlI site). The resulting clone, pPK22bo4/do4,overlaps the 5' portion of pH17727 through the unique XhoI restrictionsite. An SpeI-BamHI-NdeI restriction site cluster was appended just 5'to the ATG translational start codon by PCR-amplifying the insert fromclone pPK22bo4/do4 using primers PK22L661 (5'-GCCGTCTGGGGAAAAGA-3'; SEQID NO: 6) and PK22U210 (5'-GGACTAGTGGATCCATATGGACCAGTACTGCATCCT-3'; SEQID NO:10). The resulting PCR fragment was digested with Spe1 and XhoIand ligated into BamHI-XhoI digested pLITMUS28:H17727, resulting inpLITMUS28:CDK10 (FIG. 3).

EXAMPLE 3 Construction of CDK10 Mammalian Expression Vector

A BamHI-XbaI fragment from pLITMUS28:CDK10 comprising the CKD10 codingregion was subcloned into the mammalian expression vector, pcDNA3.1(Invitrogen), which was previously digested with BamHI and XbaI. Theresulting construct, pcDNA3.1:CDK10, contains a portion of the CMVpromoter and a T7 primer site upstream of the CDK10 ATG translationalstart codon as well as the BGH polyA region downstream of thetranslational termination codon. Of course, other components to allowgrowth in E. coli and mammalian cells are present in this vector.

EXAMPLE 4

Construction of CDK10 Baculovirus Transfer Vector

A BamHI-NcoI fragment from pLITMUS28:CDK10 containing the CKD10 codingregion was cloned into the baculovirus expression vector, pBlueBacHis2(Invitrogen), which was previously digested with BamHI and NcoI. Theresulting construct, pBBH:CDK10, may be used to express recombinantCDK10 from insect cells by following the manufacturer's instructions(e.g., see Invitrogen Cat. No. V375-20 for pBlueBacHis2 A, B, and C).

EXAMPLE 5 Construction of DNA Fragment Encoding a CDK10Dominant-Negative Mutant

The pLITMUS:CDK10 construct (see Example 2) was mutated to generate a"dominant-negative" single base pair mutation. This mutation wasgenerated from pLITMUS28:CDK10 using the Stratagene "Quik Change" kitand primers 22U-D127N: (5'-CAACATTGTACATCGGAACCTGAAACCTGCC-3'; SEQ IDNO:8), and 22L-D127N: (5'-GGCAGGTTTCAGGTTCCGATGTACAATGTTG-3'; SEQ ID NO:9), according to the manufacturer's instructions. The dominant-negativemutation changes the codon GAC (at nucleotides 588-590 of SEQ ID NO:2)to AAC (at nucleotides 588-590 of SEQ ID NO:11), thus deletion essentialamino acid Asp127 to Asn127 (see SEQ ID NO:12), which inactivates kinaseactivity (see Example 7 and van den Heuvel & Harlow, 1993, Science262:2050-2054). A CDK10-D127N construction was subcloned into pcDNA3.1(as a BamHI-Xba1 fragment), resulting in pcDNA3.1:CDK10-d127N. ACDK10-D127N construction was also subcloned into pBlueBacHis2 (as aBamHI-Nco1 fragment), resulting in pBBH:CDK10-d127N.

EXAMPLE 6 Tissue Distribution of CDK10 Expression

Human multiple tissue Northern Blot #7760-1, Human Brain Northern BlotII #7755-1, Human Brain Northern Blot III #7750-1, and "Human multipletissue Northern Dot Blot were purchased from Clontech. The probe wasmade by PCR amplifying the NotI-HindIll insert from pH17727 using the"Universal" (5'-CCCAGTCACGACGTTGTAAAACG-3'; SEQ ID NO:13) and "Reverse"(5'-AGCGGATAACAATTTCACACAGG-3': SEQ ID NO:14) primers from Gibco BRL.Twenty-five ng of the probe was labeled with ³² P using a Pharmacia"Ready-to-go" random priming kit and hybridized to the four Northernblots at high stringency according to Clontech instructions.

FIG. 4 and FIG. 5 show Northern data indicating the presence of CDK10transcripts in a variety of adult human tissue (FIG. 4) as well as inspecific regions of the adult and fetal human brain (FIG. 5). This datashows increased expression levels in the testis as well as in pituitaryand adrenal glands. Expression in various regions of the brain wasrelatively constant, with increased expression seen in the frontal andtemporal lobes and the cerebral cortex.

EXAMPLE 7 Effect of CDK:D127N on Cell Growth

Human osteosarcoma cell line Saos2 (ATCC HBT-85) was grown in DMEM highglucose medium+glutamine+10% fetal calf serum (in concentrations asrecommended by Gibco-BRL). Two replicates of the experiment wereperformed sequentially. Cells were split 1:6 into 10 cm culture dishestwo days prior to transfection. Transfection was performed using theCaPO4 method according to Chen and Okayama (1987, Mol. and Cell. Biol.7:2745-2752). Ten ug of each plasmid DNA (pcDNA3.1, pcDNA3:CDK10,pcDNA3:CDK10-D127N) was transfected into ˜60% confluent cells in each 10cm dish. Cells were rinsed 2× with Dulbecco's PBS (Gibco-BRL) and 10 mLfresh medium was added. After two days, cells were trypsinized andplated in 12 well dishes in fresh medium +500 ug/mL geneticin(Gibco-BRL). At 11 and 16 days after plating, colony counts were made todetermine how many transfected cells were capable of growth and colonyformation (Table 1). This data indicates that expression of the kinaseinactive "dominant-negative" form of CDK10 (i.e., CDK10-D127N) impairscolony formation by analogy to the data presented in van den Heuvel andHarlow (1993, Science 262: 2050-2054).

                  TABLE 1                                                         ______________________________________                                                   Day 11 (Colonies)                                                                             Day 16 (# Colonies)                                DNA construct                                                                              Rep A   Rep B     Rep A Rep B                                    ______________________________________                                        no DNA        2       1         0     0                                         pcDNA3.1 23 42 16 40                                                          pcDNA3.1:CDK10 32 59 23 49                                                    pcDNA3.1:CDK-D127N  5  1  2  1                                              ______________________________________                                    

EXAMPLE 8 Specific Effect of the Dominant Mutant CDK:D127N on Expressionof Cell Cycle Genes

HeLa cervical carcinoma cells were treated for 48 hours with a controladenovirus deleted for the E1 and E3 genes or the same adenovirus whichcomprised the construct encoding CDK10-D127N. Western blots wereperformed with a rabbit antibody raised to the C-terminal 25 amino acidsof the CDK10 protein (amino acid 301--amino acid 325 of SEQ ID NO:3).The cell line transfected with Ad/CDK10-D127N expressed CDK10-D127N at a50-fold higher level than endogenous, wild type CDK10. The two infectedcell populations were subjected to mRNA isolations and probes wereprepared for gene expression DNA chip studies essentially as describedby Lockhart, et al. (1996, Nature Biotechnology 14:1675-1680). Among thegenes which were suppressed at the mRNA level by CDK10-D127N aresummarized in Table 2.

                  TABLE 2                                                         ______________________________________                                                       Ad/CDK10- Ad-                                                    GENE D127N Control                                                          ______________________________________                                        CDC25b         8.3       19.1                                                   CDK7 3.1 8.5                                                                  CKS1 26.3 82.5                                                                CKS2 11.6 98.3                                                                Cyclin B 16.5 41.5                                                            Cyclin D1 4.5 11.5                                                            poly- 186.5 664.5                                                             Ubiquitin                                                                   ______________________________________                                         .sup.1 Quantified arbitrary expression units measured from the                fluorescence image of the oligonucleotide array.                         

These data indicate a cell cycle block by the dominant-negative mutantgene, CDK10-D127N, which shows the importance of the CDK10 protein tothe cell cycle. Cell cycle analysis (using a fluorescence-activated cellsorter, or FACS) of cells treated for 48 hours with the two virusesindicate that cells are not blocked in any particular phase of the cellcycle.

The data reported in the above Example sections show the importance ofCDK10 in the cell cycle. Therefore, a therapeutic agent comprising theCDK10 protein would be useful in the treatment of cell cycle and/orCDK10 related diseases or conditions which are CDK10 responsive as wellas showing a potential use for a dominant-negative mutant such asCDK10-D127N, which may be useful in the treatment of cell cycle diseasesor conditions which are responsive to the mutant proteins ability toregulate a phase or phases of the cell cycle.

    __________________________________________________________________________    #             SEQUENCE LISTING                                                   - -  - - (1) GENERAL INFORMATION:                                             - -    (iii) NUMBER OF SEQUENCES: 14                                          - -  - - (2) INFORMATION FOR SEQ ID NO:1:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 7 amino - #acids                                                  (B) TYPE: amino acid                                                          (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                 - -     (ii) MOLECULE TYPE: peptide                                           - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                               - -      Pro Asn Gln Ala Leu Arg Glu                                              1             - #  5                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:2:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 2074 base - #pairs                                                (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: double                                                      (D) TOPOLOGY: both                                                   - -     (ii) MOLECULE TYPE: cDNA                                              - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                               - - GAAAAGGCGC AGTGGGGCCC GGAGCTGTCA CCCCTGACTC GACGCAGCTT CC -            #GTTCTCCT     60                                                                 - - GGTGACGTCG CCTACAGGAA CCGCCCCAGT GGTCAGCTGC CGCGCTGTTG CT -            #AGGCAACA    120                                                                 - - GCGTGCGAGC TCAGATCAGC GTGGGGTGGA GGAGAAGTGG AGTTTGGAAG TT -            #CAGGGGCA    180                                                                 - - CAGGGGCACA GGCCCACGAC TGCAGCGGGA TGGACCAGTA CTGCATCCTG GG -            #CCGCATCG    240                                                                 - - GGGAGGGCGC CCACGGCATC GTCTTCAAGG CCAAGCACGT GGAGACTGGC GA -            #GATAGTTG    300                                                                 - - CCCTCAAGAA GGTGGCCCTA AGGCGGTTGG AAGACGGCTT CCCTAACCAG GC -            #CCTGCGGG    360                                                                 - - AGATTAAGGC TCTGCAGGAG ATGGAGGACA ATCAGTATGT GGTACAACTG AA -            #GGCTGTGT    420                                                                 - - TCCCACACGG TGGAGGCTTT GTGCTGGCCT TTGAGTTCAT GCTGTCGGAT CT -            #GGCCGAGG    480                                                                 - - TGGTGCGCCA TGCCCAGAGG CCACTAGCCC AGGCACAGGT CAAGAGCTAC CT -            #GCAGATGC    540                                                                 - - TGCTCAAGGG TGTCGCCTTC TGCCATGCCA ACAACATTGT ACATCGGGAC CT -            #GAAACCTG    600                                                                 - - CCAACCTGCT CATCAGCGCC TCAGGCCAGC TCAAGATAGC GGACTTTGGC CT -            #GGCTCGAG    660                                                                 - - TCTTTTCCCC AGACGGCAGC CGCCTCTACA CACACCAGGT GGCCACCAGG TC -            #TGTGGGCT    720                                                                 - - GCATCATGGG GGAGCTGTTG AATGGGTCCC CCCTTTTCCC GGGCAAGAAC GA -            #TATTGAAC    780                                                                 - - AGCTTTGCTA TGTGCTTCGC ATCTTGGGCA CCCCAAACCC TCAAGTCTGG CC -            #GGAGCTCA    840                                                                 - - CTGAGCTGCC GGACTACAAC AAGATCTCCT TTAAGGAGCA GGTGCCCATG CC -            #CCTGGAGG    900                                                                 - - AGGTGCTGCC TGACGTCTCT CCCCAGGCAT TGGATCTGCT GGGTCAATTC CT -            #TCTCTACC    960                                                                 - - CTCCTCACCA GCGCATCGCA GCTTCCAAGG CTCTCCTCCA TCAGTACTTC TT -            #CACAGCTC   1020                                                                 - - CCCTGCCTGC CCATCCATCT GAGCTGCCGA TTCCTCAGCG TCTAGGGGGA CC -            #TGCCCCCA   1080                                                                 - - AGGCCCATCC AGGGCCCCCC CACATCCATG ACTTCCACGT GGACCGGCCT CT -            #TGAGGAGT   1140                                                                 - - CGCTGTTGAA CCCAGAGCTG ATTCGGCCCT TCATCCTGGA GGGGTGAGAA GT -            #TGGCCCTG   1200                                                                 - - GTCCCGTCTG CCTGCTCCTC AGGACCACTC AGTCCACCTG TTCCTCTGCC AC -            #CTGCCTGG   1260                                                                 - - CTTCACCCTC CAAGGCCTCC CCATGGCCAC AGTGGGCCCA CACCACACCC TG -            #CCCCTTAG   1320                                                                 - - CCCTTGCGAG GGTTGGTCTC GAGGCAGAGG TCATGTTCCC AGCCAAGAGT AT -            #GAGAACAT   1380                                                                 - - CCAGTCGAGC AGAGGAGATT CATGGCCTGT GCTCGGTGAG CCTTACCTTC TG -            #TGTGCTAC   1440                                                                 - - TGACGTACCC ATCAGGACAG TGAGCTCTGC TGCCAGTCAA GGCCTGCATA TG -            #CAGAATGA   1500                                                                 - - CGATGCCTGC CTTGGTGCTG CTTCCCCGAG TGCTGCCTCC TGGTCAAGGA GA -            #AGTGCAGA   1560                                                                 - - GAGTAAGGTG TCCTTATGTT GGAAACTCAA GTGGAAGGAA GATTTGGTTT GG -            #TTTTATTC   1620                                                                 - - TCAGAGCCAT TAAACACTAG TTCAGTATGT GAGATATAGA TTCTAAAAAC CT -            #CAGGTGGC   1680                                                                 - - TCTGCCTTAT GTCTGTTCCT CCTTCATTTC TCTCAAGGGA AATGGCTAAG GT -            #GGCATTGT   1740                                                                 - - CTCATGGCTC TCGTTTTTGG GGTCATGGGG AGGGTAGCAC CAGGCATAGC CA -            #CTTTTGCC   1800                                                                 - - CTGAGGGACT CCTGTGTGCT TCACATCACT GAGCACTCAT TTAGAAGTGA GG -            #GAGACAGA   1860                                                                 - - AGTCTAGGCC CAGGGATGGC TCCAGTTGGG GATCCAGCAG GAGACCCTCT GC -            #ACATGAGG   1920                                                                 - - CTGGTTTACC AACATCTACT CCCTCAGGAT GAGCGTGAGC CAGAAGCAGC TG -            #TGTATTTA   1980                                                                 - - AGGAAACAAG CGTTCCTGGA ATTAATTTAT AAATTTAATA AATCCCAATA TA -            #ATCCCAAA   2040                                                                 - - AAAAAAAAAA AAAAAATTCC TGCGGCCGCA AGGA       - #                  -     #      2074                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:3:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 325 amino - #acids                                                (B) TYPE: amino acid                                                          (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                 - -     (ii) MOLECULE TYPE: protein                                           - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                               - -      Met Asp Gln Tyr Cys Ile Leu Gly - # Arg Ile Gly Glu Gly Ala        His Gly                                                                              1             - #  5                - #   10               - #         15                                                                               - -      Ile Val Phe Lys Ala Lys His Val - # Glu Thr Gly Glu Ile Val       Ala Leu                                                                                          20 - #                 25 - #                 30             - -      Lys Lys Val Ala Leu Arg Arg Leu - # Glu Asp Gly Phe Pro Asn        Gln Ala                                                                                      35     - #             40     - #             45                  - -      Leu Arg Glu Ile Lys Ala Leu Gln - # Glu Met Glu Asp Asn Gln       Tyr Val                                                                                  50         - #         55         - #         60                      - -      Val Gln Leu Lys Ala Val Phe Pro - # His Gly Gly Gly Phe Val       Leu Ala                                                                              65             - #     70             - #     75             - #         80                                                                            - -      Phe Glu Phe Met Leu Ser Asp Leu - # Ala Glu Val Val Arg His        Ala Gln                                                                                           - #   85               - #   90               - #         95                                                                               - -      Arg Pro Leu Ala Gln Ala Gln Val - # Lys Ser Tyr Leu Gln Met       Leu Leu                                                                                          100 - #                105 - #                110            - -      Lys Gly Val Ala Phe Cys His Ala - # Asn Asn Ile Val His Arg        Asp Leu                                                                                      115     - #            120     - #            125                 - -      Lys Pro Ala Asn Leu Leu Ile Ser - # Ala Ser Gly Gln Leu Lys       Ile Ala                                                                                  130         - #        135         - #        140                     - -      Asp Phe Gly Leu Ala Arg Val Phe - # Ser Pro Asp Gly Ser Arg       Leu Tyr                                                                              145             - #    150             - #    155             - #        160                                                                           - -      Thr His Gln Val Ala Thr Arg Ser - # Val Gly Cys Ile Met Gly        Glu Leu                                                                                           - #   165              - #   170              - #         175                                                                              - -      Leu Asn Gly Ser Pro Leu Phe Pro - # Gly Lys Asn Asp Ile Glu       Gln Leu                                                                                          180 - #                185 - #                190            - -      Cys Tyr Val Leu Arg Ile Leu Gly - # Thr Pro Asn Pro Gln Val        Trp Pro                                                                                      195     - #            200     - #            205                 - -      Glu Leu Thr Glu Leu Pro Asp Tyr - # Asn Lys Ile Ser Phe Lys       Glu Gln                                                                                  210         - #        215         - #        220                     - -      Val Pro Met Pro Leu Glu Glu Val - # Leu Pro Asp Val Ser Pro       Gln Ala                                                                              225             - #    230             - #    235             - #        240                                                                           - -      Leu Asp Leu Leu Gly Gln Phe Leu - # Leu Tyr Pro Pro His Gln        Arg Ile                                                                                           - #   245              - #   250              - #         255                                                                              - -      Ala Ala Ser Lys Ala Leu Leu His - # Gln Tyr Phe Phe Thr Ala       Pro Leu                                                                                          260 - #                265 - #                270            - -      Pro Ala His Pro Ser Glu Leu Pro - # Ile Pro Gln Arg Leu Gly        Gly Pro                                                                                      275     - #            280     - #            285                 - -      Ala Pro Lys Ala His Pro Gly Pro - # Pro His Ile His Asp Phe       His Val                                                                                  290         - #        295         - #        300                     - -      Asp Arg Pro Leu Glu Glu Ser Leu - # Leu Asn Pro Glu Leu Ile       Arg Pro                                                                              305             - #    310             - #    315             - #        320                                                                           - -      Phe Ile Leu Glu Gly                                                                   - #   325                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:4:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 20 base - #pairs                                                  (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                 - -     (ii) MOLECULE TYPE: other nucleic acid                                         (A) DESCRIPTION: /desc - #= "oligonucleotide"                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                               - - TGATGCAGCC CACAGACCTG            - #                  - #                      - # 20                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:5:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 27 base - #pairs                                                  (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                 - -     (ii) MOLECULE TYPE: other nucleic acid                                         (A) DESCRIPTION: /desc - #= "oligonucleotide"                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                               - - CCATCCTAAT ACGACTCACT ATAGGGC          - #                  - #                 27                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:6:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 17 base - #pairs                                                  (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                 - -     (ii) MOLECULE TYPE: other nucleic acid                                         (A) DESCRIPTION: /desc - #= "oligonucleotide"                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                               - - GCCGTCTGGG GAAAAGA             - #                  - #                      - #   17                                                                   - -  - - (2) INFORMATION FOR SEQ ID NO:7:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 23 base - #pairs                                                  (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                 - -     (ii) MOLECULE TYPE: other nucleic acid                                         (A) DESCRIPTION: /desc - #= "oligonucleotide"                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                               - - ACTCACTATA GGGCTCGAGC GGC           - #                  - #                    23                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:8:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 31 base - #pairs                                                  (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                 - -     (ii) MOLECULE TYPE: other nucleic acid                                         (A) DESCRIPTION: /desc - #= "oligonucleotide"                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                               - - CAACATTGTA CATCGGAACC TGAAACCTGC C        - #                  - #              31                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:9:                                     - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 31 base - #pairs                                                  (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                 - -     (ii) MOLECULE TYPE: other nucleic acid                                         (A) DESCRIPTION: /desc - #= "oligonucleotide"                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                               - - GGCAGGTTTC AGGTTCCGAT GTACAATGTT G        - #                  - #              31                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:10:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 36 base - #pairs                                                  (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                 - -     (ii) MOLECULE TYPE: other nucleic acid                                         (A) DESCRIPTION: /desc - #= "oligonucleotide"                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                              - - GGACTAGTGG ATCCATATGG ACCAGTACTG CATCCT      - #                  -     #       36                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:11:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 2074 base - #pairs                                                (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: both                                                   - -     (ii) MOLECULE TYPE: cDNA                                              - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                              - - GAAAAGGCGC AGTGGGGCCC GGAGCTGTCA CCCCTGACTC GACGCAGCTT CC -             #GTTCTCCT     60                                                                 - - GGTGACGTCG CCTACAGGAA CCGCCCCAGT GGTCAGCTGC CGCGCTGTTG CT -            #AGGCAACA    120                                                                 - - GCGTGCGAGC TCAGATCAGC GTGGGGTGGA GGAGAAGTGG AGTTTGGAAG TT -            #CAGGGGCA    180                                                                 - - CAGGGGCACA GGCCCACGAC TGCAGCGGGA TGGACCAGTA CTGCATCCTG GG -            #CCGCATCG    240                                                                 - - GGGAGGGCGC CCACGGCATC GTCTTCAAGG CCAAGCACGT GGAGACTGGC GA -            #GATAGTTG    300                                                                 - - CCCTCAAGAA GGTGGCCCTA AGGCGGTTGG AAGACGGCTT CCCTAACCAG GC -            #CCTGCGGG    360                                                                 - - AGATTAAGGC TCTGCAGGAG ATGGAGGACA ATCAGTATGT GGTACAACTG AA -            #GGCTGTGT    420                                                                 - - TCCCACACGG TGGAGGCTTT GTGCTGGCCT TTGAGTTCAT GCTGTCGGAT CT -            #GGCCGAGG    480                                                                 - - TGGTGCGCCA TGCCCAGAGG CCACTAGCCC AGGCACAGGT CAAGAGCTAC CT -            #GCAGATGC    540                                                                 - - TGCTCAAGGG TGTCGCCTTC TGCCATGCCA ACAACATTGT ACATCGGAAC CT -            #GAAACCTG    600                                                                 - - CCAACCTGCT CATCAGCGCC TCAGGCCAGC TCAAGATAGC GGACTTTGGC CT -            #GGCTCGAG    660                                                                 - - TCTTTTCCCC AGACGGCAGC CGCCTCTACA CACACCAGGT GGCCACCAGG TC -            #TGTGGGCT    720                                                                 - - GCATCATGGG GGAGCTGTTG AATGGGTCCC CCCTTTTCCC GGGCAAGAAC GA -            #TATTGAAC    780                                                                 - - AGCTTTGCTA TGTGCTTCGC ATCTTGGGCA CCCCAAACCC TCAAGTCTGG CC -            #GGAGCTCA    840                                                                 - - CTGAGCTGCC GGACTACAAC AAGATCTCCT TTAAGGAGCA GGTGCCCATG CC -            #CCTGGAGG    900                                                                 - - AGGTGCTGCC TGACGTCTCT CCCCAGGCAT TGGATCTGCT GGGTCAATTC CT -            #TCTCTACC    960                                                                 - - CTCCTCACCA GCGCATCGCA GCTTCCAAGG CTCTCCTCCA TCAGTACTTC TT -            #CACAGCTC   1020                                                                 - - CCCTGCCTGC CCATCCATCT GAGCTGCCGA TTCCTCAGCG TCTAGGGGGA CC -            #TGCCCCCA   1080                                                                 - - AGGCCCATCC AGGGCCCCCC CACATCCATG ACTTCCACGT GGACCGGCCT CT -            #TGAGGAGT   1140                                                                 - - CGCTGTTGAA CCCAGAGCTG ATTCGGCCCT TCATCCTGGA GGGGTGAGAA GT -            #TGGCCCTG   1200                                                                 - - GTCCCGTCTG CCTGCTCCTC AGGACCACTC AGTCCACCTG TTCCTCTGCC AC -            #CTGCCTGG   1260                                                                 - - CTTCACCCTC CAAGGCCTCC CCATGGCCAC AGTGGGCCCA CACCACACCC TG -            #CCCCTTAG   1320                                                                 - - CCCTTGCGAG GGTTGGTCTC GAGGCAGAGG TCATGTTCCC AGCCAAGAGT AT -            #GAGAACAT   1380                                                                 - - CCAGTCGAGC AGAGGAGATT CATGGCCTGT GCTCGGTGAG CCTTACCTTC TG -            #TGTGCTAC   1440                                                                 - - TGACGTACCC ATCAGGACAG TGAGCTCTGC TGCCAGTCAA GGCCTGCATA TG -            #CAGAATGA   1500                                                                 - - CGATGCCTGC CTTGGTGCTG CTTCCCCGAG TGCTGCCTCC TGGTCAAGGA GA -            #AGTGCAGA   1560                                                                 - - GAGTAAGGTG TCCTTATGTT GGAAACTCAA GTGGAAGGAA GATTTGGTTT GG -            #TTTTATTC   1620                                                                 - - TCAGAGCCAT TAAACACTAG TTCAGTATGT GAGATATAGA TTCTAAAAAC CT -            #CAGGTGGC   1680                                                                 - - TCTGCCTTAT GTCTGTTCCT CCTTCATTTC TCTCAAGGGA AATGGCTAAG GT -            #GGCATTGT   1740                                                                 - - CTCATGGCTC TCGTTTTTGG GGTCATGGGG AGGGTAGCAC CAGGCATAGC CA -            #CTTTTGCC   1800                                                                 - - CTGAGGGACT CCTGTGTGCT TCACATCACT GAGCACTCAT TTAGAAGTGA GG -            #GAGACAGA   1860                                                                 - - AGTCTAGGCC CAGGGATGGC TCCAGTTGGG GATCCAGCAG GAGACCCTCT GC -            #ACATGAGG   1920                                                                 - - CTGGTTTACC AACATCTACT CCCTCAGGAT GAGCGTGAGC CAGAAGCAGC TG -            #TGTATTTA   1980                                                                 - - AGGAAACAAG CGTTCCTGGA ATTAATTTAT AAATTTAATA AATCCCAATA TA -            #ATCCCAAA   2040                                                                 - - AAAAAAAAAA AAAAAATTCC TGCGGCCGCA AGGA       - #                  -     #      2074                                                                     - -  - - (2) INFORMATION FOR SEQ ID NO:12:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 325 amino - #acids                                                (B) TYPE: amino acid                                                          (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                 - -     (ii) MOLECULE TYPE: protein                                           - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                              - -      Met Asp Gln Tyr Cys Ile Leu Gly - # Arg Ile Gly Glu Gly Ala        His Gly                                                                              1             - #  5                - #   10               - #         15                                                                               - -      Ile Val Phe Lys Ala Lys His Val - # Glu Thr Gly Glu Ile Val       Ala Leu                                                                                          20 - #                 25 - #                 30             - -      Lys Lys Val Ala Leu Arg Arg Leu - # Glu Asp Gly Phe Pro Asn        Gln Ala                                                                                      35     - #             40     - #             45                  - -      Leu Arg Glu Ile Lys Ala Leu Gln - # Glu Met Glu Asp Asn Gln       Tyr Val                                                                                  50         - #         55         - #         60                      - -      Val Gln Leu Lys Ala Val Phe Pro - # His Gly Gly Gly Phe Val       Leu Ala                                                                              65             - #     70             - #     75             - #         80                                                                            - -      Phe Glu Phe Met Leu Ser Asp Leu - # Ala Glu Val Val Arg His        Ala Gln                                                                                           - #   85               - #   90               - #         95                                                                               - -      Arg Pro Leu Ala Gln Ala Gln Val - # Lys Ser Tyr Leu Gln Met       Leu Leu                                                                                          100 - #                105 - #                110            - -      Lys Gly Val Ala Phe Cys His Ala - # Asn Asn Ile Val His Arg        Asn Leu                                                                                      115     - #            120     - #            125                 - -      Lys Pro Ala Asn Leu Leu Ile Ser - # Ala Ser Gly Gln Leu Lys       Ile Ala                                                                                  130         - #        135         - #        140                     - -      Asp Phe Gly Leu Ala Arg Val Phe - # Ser Pro Asp Gly Ser Arg       Leu Tyr                                                                              145             - #    150             - #    155             - #        160                                                                           - -      Thr His Gln Val Ala Thr Arg Ser - # Val Gly Cys Ile Met Gly        Glu Leu                                                                                           - #   165              - #   170              - #         175                                                                              - -      Leu Asn Gly Ser Pro Leu Phe Pro - # Gly Lys Asn Asp Ile Glu       Gln Leu                                                                                          180 - #                185 - #                190            - -      Cys Tyr Val Leu Arg Ile Leu Gly - # Thr Pro Asn Pro Gln Val        Trp Pro                                                                                      195     - #            200     - #            205                 - -      Glu Leu Thr Glu Leu Pro Asp Tyr - # Asn Lys Ile Ser Phe Lys       Glu Gln                                                                                  210         - #        215         - #        220                     - -      Val Pro Met Pro Leu Glu Glu Val - # Leu Pro Asp Val Ser Pro       Gln Ala                                                                              225             - #    230             - #    235             - #        240                                                                           - -      Leu Asp Leu Leu Gly Gln Phe Leu - # Leu Tyr Pro Pro His Gln        Arg Ile                                                                                           - #   245              - #   250              - #         255                                                                              - -      Ala Ala Ser Lys Ala Leu Leu His - # Gln Tyr Phe Phe Thr Ala       Pro Leu                                                                                          260 - #                265 - #                270            - -      Pro Ala His Pro Ser Glu Leu Pro - # Ile Pro Gln Arg Leu Gly        Gly Pro                                                                                      275     - #            280     - #            285                 - -      Ala Pro Lys Ala His Pro Gly Pro - # Pro His Ile His Asp Phe       His Val                                                                                  290         - #        295         - #        300                     - -      Asp Arg Pro Leu Glu Glu Ser Leu - # Leu Asn Pro Glu Leu Ile       Arg Pro                                                                              305             - #    310             - #    315             - #        320                                                                           - -      Phe Ile Leu Glu Gly                                                                   - #   325                                                    - -  - - (2) INFORMATION FOR SEQ ID NO:13:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 23 base - #pairs                                                  (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                 - -     (ii) MOLECULE TYPE: other nucleic acid                                         (A) DESCRIPTION: /desc - #= "oligonucleotide"                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                              - - CCCAGTCACG ACGTTGTAAA ACG           - #                  - #                    23                                                                      - -  - - (2) INFORMATION FOR SEQ ID NO:14:                                    - -      (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 23 base - #pairs                                                  (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                 - -     (ii) MOLECULE TYPE: other nucleic acid                                         (A) DESCRIPTION: /desc - #= "Oligonucleotide"                        - -     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                              - - AGCGGATAAC AATTTCACAC AGG           - #                  - #                    23                                                                    __________________________________________________________________________

What is claimed is:
 1. A purified DNA molecule encoding a human cyclindependent kinase wherein said DNA molecule encodes a protein comprisingthe amino acid sequence as follows:

    Met Asp Gln Tyr Cys Ile Leu Gly Arg Ile Gly Glu                                 - Gly Ala His Gly Ile Val Phe Lys Ala Lys His Val                             - Glu Thr Gly Glu Ile Val Ala Leu Lys Lys Val Ala                             - Leu Arg Arg Leu Glu Asp Gly Phe Pro Asn Gln Ala                             - Leu Arg Glu Ile Lys Ala Leu Gln Glu Met Glu Asp                             - Asn Gln Tyr Val Val Gln Leu Lys Ala Val Phe Pro                             - His Gly Gly Gly Phe Val Leu Ala Phe Glu Phe Met                             - Leu Ser Asp Leu Ala Glu Val Val Arg His Ala Gln                             - Arg Pro Leu Ala Gln Ala Gln Val Lys Ser Tyr Leu                             - Gln Met Leu Leu Lys Gly Val Ala Phe Cys His Ala                             - Asn Asn Ile Val His Arg Asp Leu Lys Pro Ala Asn                             - Leu Leu Ile Ser Ala Ser Gly Gln Leu Lys Ile Ala                             - Asp Phe Gly Leu Ala Arg Val Phe Ser Pro Asp Gly                             - Ser Arg Leu Tyr Thr His Gln Val Ala Thr Arg Ser                             - Val Gly Cys Ile Met Gly Glu Leu Leu Asn Gly Ser                             - Pro Leu Phe Pro Gly Lys Asn Asp Ile Glu Gln Leu                             - Cys Tyr Val Leu Arg Ile Leu Gly Thr Pro Asn Pro                             - Gln Val Trp Pro Glu Leu Thr Glu Leu Pro Asp Tyr                             - Asn Lys Ile Ser Phe Lys Glu Gln Val Pro Met Pro                             - Leu Glu Glu Val Leu Pro Asp Val Ser Pro Gln Ala                             - Leu Asp Leu Leu Gly Gln Phe Leu Leu Tyr Pro Pro                             - His Gln Arg Ile Ala Ala Ser Lys Ala Leu Leu His                             - Gln Tyr Phe Phe Thr Ala Pro Leu Pro Ala His Pro                             - Ser Glu Leu Pro Ile Pro Gln Arg Leu Gly Gly Pro                             - Ala Pro Lys Ala His Pro Gly Pro Pro His Ile His                             - Asp Phe His Val Asp Arg Pro Leu Glu Glu Ser Leu                             - Leu Asn Pro Glu Leu Ile Arg Pro Phe Ile Leu Glu                             - Gly,                                                                  

set forth as SEQ ID NO:3.
 2. A purified DNA molecule encoding a humancyclic dependent kinase comprising which comprises the nucleotidesequence as follows:

                           GAAAAGGCGC AGTGGGGCCC GGAGCTGTCA CCCCTGACTC                                   GACGCAGCTT CCGTTCTCCT                                    GGTGACGTCG CCTACAGGAA CCGCCCCAGT GGTCAGCTGC CGCGCTGTTG CTAGGCAACA                                   GCGTGCGAGC TCAGATCAGC GTGGGGTGGA GGAGAAGTGG                                  AGTTTGGAAG TTCAGGGGCA                                    CAGGGGCACA GGCCCACGAC TGCAGCGGGA TGGACCAGTA CTGCATCCTG GGCCGCATCG                                   GGGAGGGCGC CCACGGCATC GTCTTCAAGG CCAAGCACGT                                  GGAGACTGGC GAGATAGTTG                                    CCCTCAAGAA GGTGGCCCTA AGGCGGTTGG AAGACGGCTT CCCTAACCAG GCCCTGCGGG                                   AGATTAAGGC TCTGCAGGAG ATGGAGGACA ATCAGTATGT                                  GGTACAACTG AAGGCTGTGT                                    TCCCACACGG TGGAGGCTTT GTGCTGGCCT TTGAGTTCAT GCTGTCGGAT CTGGCCGAGG                                   TGGTGCGCCA TGCCCAGAGG CCACTAGCCC AGGCACAGGT                                  CAAGAGCTAC CTGCAGATGC                                    TGCTCAAGGG TGTCGCCTTC TGCCATGCCA ACAACATTGT ACATCGGGAC CTGAAACCTG                                   CCAACCTGCT CATCAGCGCC TCAGGCCAGC TCAAGATAGC                                  GGACTTTGGC CTGGCTCGAG                                    TCTTTTCCCC AGACGGCAGC CGCCTCTACA CACACCAGGT GGCCACCAGG TCTGTGGGCT                                   GCATCATGGG GGAGCTGTTG AATGGGTCCC CCCTTTTCCC                                  GGGCAAGAAC GATATTGAAC                                    AGCTTTGCTA TGTGCTTCGC ATCTTGGGCA CCCCAAACCC TCAAGTCTGG CCGGAGCTCA                                   CTGAGCTGCC GGACTACAAC AAGATCTCCT TTAAGGAGCA                                  GGTGCCCATG CCCCTGGAGG                                    AGGTGCTGCC TGACGTCTCT CCCCAGGCAT TGGATCTGCT GGGTCAATTC CTTCTCTACC                                   CTCCTCACCA GCGCATCGCA GCTTCCAAGG CTCTCCTCCA                                  TCAGTACTTC TTCACAGCTC                                    CCCTGCCTGC CCATCCATCT GAGCTGCCGA TTCCTCAGCG TCTAGGGGGA CCTGCCCCCA                                   AGGCCCATCC AGGGCCCCCC CACATCCATG ACTTCCACGT                                  GGACCGGCCT CTTGAGGAGT                                    CGCTGTTGAA CCCAGAGCTG ATTCGGCCCT TCATCCTGGA GGGGTGAGAA GTTGGCCCTG                                   GTCCCGTCTG CCTGCTCCTC AGGACCACTC AGTCCACCTG                                  TTCCTCTGCC ACCTGCCTGG                                    CTTCACCCTC CAAGGCCTCC CCATGGCCAC AGTGGGCCCA CACCACACCC TGCCCCTTAG                                   CCCTTGCGAG GGTTGGTCTC GAGGCAGAGG TCATGTTCCC                                  AGCCAAGAGT ATGAGAACAT                                    CCAGTCGAGC AGAGGAGATT CATGGCCTGT GCTCGGTGAG CCTTACCTTC TGTGTGCTAC                                   TGACGTACCC ATCAGGACAG TGAGCTCTGC TGCCAGTCAA                                  GGCCTGCATA TGCAGAATGA                                    CGATGCCTGC CTTGGTGCTG CTTCCCCGAG TGCTGCCTCC TGGTCAAGGA GAAGTGCAGA                                   GAGTAAGGTG TCCTTATGTT GGAAACTCAA GTGGAAGGAA                                  GATTTGGTTT GGTTTTATTC                                    TCAGAGCCAT TAAACACTAG TTCAGTATGT GAGATATAGA TTCTAAAAAC CTCAGGTCGC                                   TCTGCCTTAT GTCTGTTCCT CCTTCATTTC GAGATATAGA                                  TCTCAAGGGA AATGGCTAAG                                    CTCATGGCTC TCGTTTTTGG GGTCATGGGG AGGGTAGCAC CAGGCATAGC CACTTTTGCC                                   CTGAGGGACT CCTGTGTGCT TCACATCACT GAGCACTCAT                                  TTAGAAGTGA GGGAGACAGA                                    AGTCTAGGCC CAGGGATGGC TCCAGTTGCG GATCCAGCAG GAGACCCTCT GCACATGAGG                                   CTGGTTTACC AACATCTACT CCCTCAGGAT GAGCGTGAGC                                  CAGAAGCAGC TGTGTATTTA                                    AGGAAACAAG CGTTCCTGGA ATTAATTTAT AAATTTAATA AATCCCAATA                        AAAAAAAAAA AAAAAATTCC TGCGGCCGCA AGGA,                                  

set forth as SEQ ID NO:2.
 3. A DNA molecule of claim 2 which comprisesnucleotide 210 to nucleotide 1185 of SEQ ID NO:2.
 4. A DNA molecule ofclaim 2 consisting of nucleotide 210 to nucleotide 1185 of seq ID No: 2.5. An expression vector for the expression of a human cyclin dependentkinase in a recombinant host cell wherein said expression vectorcomprises the DNA molecule of claim
 2. 6. The expression vector of claim5 which is selected from the group consisting of pLITMUS28:CDK10,pcDNA3.1:CDK10, and pBBH:CDK10.
 7. A host cell which expresses arecombinant human cyclin dependent kinase wherein said host cellcontains the expression vector of claim
 5. 8. A host cell whichexpresses a recombinant human cyclin dependent kinase wherein said hostcell contains the expression vector of claim
 6. 9. A process for theexpression of a human cyclin dependent kinase protein in a recombinanthost cell, comprising:(a) transfecting the expression vector of claim 3into a suitable host cell; and, (b) culturing the host cells of step (a)under conditions which allow expression of the human cyclin dependentkinase protein from the expression vector.
 10. A purified DNA moleculeencoding a mutant of human cyclic dependent kinase with decreased kinaseactivity, said DNA molecule comprising which comprises the nucleotidesequence as follows:

    GAAAAGGCGC AGTGGGGCCC GGAGCTGTCA CCCCTGACTC GACGCAGCTT CCGTTCTCCT                                                      GGTGACGTCG                                                                    CCTACAGGAA CCGCCCCAGT GGTCAGCTGC                                             CGCGCTGTTG CTAGGCAACA                   GCGTGCGAGC TCAGATCAGC GTGGGGTGGA GGAGAAGTGG AGTTTGGAAG TTCAGGGGCA                                                    CAGGGGCACA GGCCCACGAC TGCAGCGGGA                                             TGGACCAGTA CTGCATCCTG GGCCGCATCG                                               GGGAGGGCGC CCACGGCATC GTCTTCAAGG                                             CCAAGCACGT GGAGACTGGC GAGATAGTTG                                               CCCTCAAGAA GGTGGCCCTA AGGCGGTTGG                                             AAGACGGCTT CCCTAACCAG GCCCTGCGGG                                               AGATTAAGGC TCTGCAGGAG ATGGAGGACA                                             ATCAGTATGT GGTACAACTG AAGGCTGTGT                                               TCCCACACGG TGGAGGCTTT GTGCTGGCCT                                             TTGAGTTCAT GCTGTCGGAT CTGGCCGAGG                                               TGGTGCGCCA TGCCCAGAGG CCACTAGCCC                                             AGGCACAGGT CAAGAGCTAC CTGCAGATGC                                               TGCTCAAGGG TGTCGCCTTC TGCCATGCCA                                             ACAACATTGT ACATCGGAAC CTGAAACCTG                                               CCAACCTGCT CATCAGCGCC TCAGGCCAGC                                             TCAAGATAGC GGACTTTGGC CTGGCTCGAG                                               TCTTTTCCCC AGACGGCAGC CGCCTCTACA                                             CACACCAGGT GGCCACCAGG TCTGTGGGCT                                               GCATCATGGG GGAGCTGTTG AATGGGTCCC                                             CCCTTTTCCC GGGCAAGAAC GATATTGAAC                                               AGCTTTGCTA TGTGCTTCGC ATCTTGGGCA                                             CCCCAAACCC TCAAGTCTGG CCGGAGCTCA                                               CTGAGCTGCC GGACTACAAC AAGATCTCCT                                             TTAAGGAGCA GGTGCCCATG CCCCTGGAGG                                               AGGTGCTGCC TGACGTCTCT CCCCAGGCAT                                             TGGATCTGCT GGGTCAATTC CTTCTCTACC                                               CTCCTCACCA GCGCATCGCA GCTTCCAAGG                                             CTCTCCTCCA TCAGTACTTC TTCACAGCTC                                               CCCTGCCTGC CCATCCATCT GAGCTGCCGA                                             TTCCTCAGCG TCTAGGGGGA CCTGCCCCCA                                               AGGCCCATCC AGGGCCCCCC CACATCCATG                                             ACTTCCACGT GGACCGGCCT CTTGAGGAGT                                               CGCTGTTGAA CCCAGAGCTG ATTCGGCCCT                                             TCATCCTGGA GGGGTGAGAA GTTGGCCCTG                                               GTCCCGTCTG CCTGCTCCTC AGGACCACTC                                             AGTCCACCTG TTCCTCTGCC ACCTGCCTGG                                               CTTCACCCTC CAAGGCCTCC CCATGGCCAC                                             AGTGGGCCCA CACCACACCC TGCCCCTTAG                                               CCCTTGCGAG GGTTGGTCTC GAGGCAGAGG                                             TCATGTTCCC AGCCAAGAGT ATGAGAACAT                                               CCAGTCGAGC AGAGGAGATT CATGGCCTGT                                             GCTCGGTGAG CCTTACCTTC TGTGTGCTAC                                               TGACGTACCC ATCAGGACAG TGAGCTCTGC                                             TGCCAGTCAA GGCCTGCATA TGCAGAATGA                                               CGATGCCTGC CTTGGTGCTG CTTCCCCGAG                                             TGCTGCCTCC TGGTCAAGGA GAAGTGCAGA                                               GAGTAAGGTG TCCTTATGTT GGAAACTCAA                                             GTGGAAGGAA GATTTGGTTT GGTTTTATTC                                               TCAGAGCCAT TAAACACTAG TTCAGTATGT                                             GAGATATAGA TTCTAAAAAC CTCAGGTGGC                                               TCTGCCTTAT GTCTGTTCCT CCTTCATTTC                                             TCTCAAGGGA AATGGCTAAG GTGGCATTGT                                               CTCATGGCTC TCGTTTTTGG GGTCATGGGG                                             AGGGTAGCAC CAGGCATAGC CACTTTTGCC                                               CTGAGGGACT CCTGTGTGCT TCACATCACT                                             GAGCACTCAT TTAGAAGTGA GGGAGACAGA                                               AGTCTAGGCC CAGGGATGGC TCCAGTTGGG                                             GATCCAGCAG GAGACCCTCT GCACATGAGG                                               CTGGTTTACC AACATCTACT CCCTCAGGAT                                             GAGCGTGAGC CAGAAGCAGC TGTGTATTTA                                               AGGAAACAAG CGTTCCTGGA ATTAATTTAT                                             AAATTTAATA AATCCCAATA TAATCCCAAA  

set forth as SEQ ID NO:11.
 11. A purified DNA molecule encoding a mutantof human cyclin dependent kinase with decreased activity, wherein saidDNA molecule encodes a protein comprising the amino acid sequence asfollows:

    Met Asp Gln Tyr Cys Ile Leu Gly Arg Ile Gly Glu G1y Ala His Gly                 Ile Val Phe Lys Ala Lys His Val Glu Thr Gly Glu Ile Val Ala Leu                                                     Lys Lys Val Ala Leu Arg Arg Leu                                              Glu Asp Gly Phe Pro Asn Gln Ala                                                Leu Arg Glu Ile Lys Ala Leu Gln                                              Glu Met Glu Asp Asn Gln Tyr Val                                                Val Gln Leu Lys Ala Val Phe Pro                                              His Gly Gly Gly Phe Val Leu Ala                                                Phe Glu Phe Met Leu Ser Asp Leu                                              Ala Glu Val Val Arg His Ala Gln                                                Arg Pro Leu Ala Gln Ala Gln Val                                              Lys Ser Tyr Leu Gln Met Leu Leu                                                Lys Gly Val Ala Phe Cys His Ala                                              Asn Asn Ile Val His Arg Asn Leu                                                Lys Pro Ala Asn Leu Leu Ile Ser                                              Ala Ser Gly Gln Leu Lys Ile Ala                                                Asp Phe Gly Leu Ala Arg Val Phe                                              Ser Pro Asp Gly Ser Arg Leu Tyr                                                Thr His Gln Val Ala Thr Arg Ser                                              Val Gly Cys Ile Met Gly Glu Leu                                                Leu Asn Gly Ser Pro Leu Phe Pro                                              Gly Lys Asn Asp Ile Glu Gln Leu                                                Cys Tyr Val Leu Arg Ile Leu Gly                                              Thr Pro Asn Pro Gln Val Trp Pro                                                Glu Leu Thr Glu Leu Pro Asp Tyr                                              Asn Lys Ile Ser Phe Lys Glu Gln                                                Val Pro Met Pro Leu Glu Glu Val                                              Leu Pro Asp Val Ser Pro Gln Ala                                                Leu Asp Leu Leu Gly Gln Phe Leu                                              Leu Tyr Pro Pro His Gln Arg Ile                                                Ala Ala Ser Lys Ala Leu Leu His                                              Gln Tyr Phe Phe Thr Ala Pro Leu                                                Pro Ala His Pro Ser Glu Leu Pro                                              Ile Pro Gln Arg Leu Gly Gly Pro                                                Ala Pro Lys Ala His Pro Gly Pro                                              Pro His Ile His Asp Phe His Val                                                Asp Arg Pro Leu Glu Glu Ser Leu                                              Leu Asn Pro Glu Leu Ile Arg Pro                                                Phe Ile Leu Glu Gly,              

set forth as SEQ ID NO:12.
 12. A DNA molecule of claim 10 consisting ofthe nucleotide sequence as set forth in SEQ ID NO:
 11. 13. An expressionvector for the expression of a human cyclin dependent kinase in arecombinant host cell wherein said expression vector comprises the DNAmolecule of claim
 10. 14. The expression vector of claim 13 which isselected from the group consisting of pcDNA3.1:CDK10-D127N andpBBH:CDK10-D127N.